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(57) Abstract 

A new receptor family has been identified, of activin-like kinases. Novel proteins have activin/TGF-p-type I receptor func- 
tionality, and have consequential diagnostic/therapeutic utility. They may have a serine/threonine kinase domain, a DFKSRN or 
DLKSKN sequence in subdomain VI3 and/or a GTKRYM sequence in subdomain VIII. 
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ACTIVIN RECEPTOR-LIKE KINASES, PROTEINS HAVING 

SERINE THREONINE KINASE DOMAINS AND THEIR USE. 
************ 

Field of the Invention 

This invention relates to proteins having 
5 serine/threonine kinase domains, corresponding nucleic acid 
molecules, and their use. 
Background of the Invention 

The transforming growth factpr-B (TGF-6) superfamily 
consists of a family of structurally-related proteins, 
10 including three different mammalian isof orms of TGF-d (TGF- 
fll, 62 and B3) , activins, inhibins, milllerian-inhibiting 
substance and bone morphogenic proteins (BMPs) (for reviews 
see Roberts and Sporn, (1990) Peptide Growth Factors and 
Their Receptors, Pt.l, Sporn and Roberts, eds. (Berlin: 
15 Springer - Verlag) pp 419-472; Moses et al (1990) Cell §2., 
245-247). The proteins of the TGF-B superfamily have a 
wide variety of biological activities. TGF-B acts as a 
growth inhibitor for many cell types and appears to play a 
central role in the regulation of embryonic development, 
20 tissue regeneration, immuno-regulation, as well as in 
fibrosis and carcinogenesis (Roberts and Sporn (199) see 
above) . 

Activins and inhibins were originally identified as 
factors which regulate secretion of follicle-stimulating 

25 hormone secretion (Vale et al (1990) Peptide Growth Factors 
and Their Receptors, Pt.2, Sporn and Roberts, eds. (Berlin: 
Springer-Verlag) pp. 211-248). Activins were also shown to 
induce the differentiation of haematopoietic progenitor 
cells (Murata e£ al (1988) Proc. Natl. Acad. Sci. USA ££, 

30 2434 - 2438; Eto at al (1987) Biochem. Biophys. Res. 
Commun. 142, 1095-1103) and induce mesoderm formation in 
Xenopus embryos (Smith at &1 (1990) Nature 3J5_, 729-731; 
van den Eijnden-Van Raaij e£ al (1990) Nature 732- 
734). 

35 . BMPs or osteogenic proteins which induce the formation 

of bone and cartilage when implanted subcutaneous ly (Wozney 
&1 (1988) Science 242, 1528-1534), facilitate neuronal 
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differentiation (Paralkar fit fil (1992) J. Cell Biol. 112, 
1721-1728) and induce monocyte chemotaxis (Cunningham fit fil 
(1992) Proc. Natl. Acad. Sci. USA £2, 11740-11744). 
Mtillerian-inhibiting substance induces regression of the 
5 Mtillerian duct in the male reproductive system (Cate fit fil 
(1986) Cell AS, 685-698), and a glial cell line-derived 
neurotrophic factor enhances survival of midbrain 
dopaminergic neurons (Lin fit fil (1993) Science 2££, 1130- 
1132) . The action of these growth factors is mediated 

10 through binding to specific cell surface receptors. 

Within this family, TGF-fl receptors have been most 
thoroughly characterized. By covalently cross-linking 
radio-labelled TGF-B to cell surface molecules followed by 
polyacrylamide gel electrophoresis of the affinity-labelled 

15 complexes, three distinct size classes of cell surface 
proteins (in most cases) have been identified, denoted 
receptor type I (53 kd) , type II (75 kd) , type III or 
betaglycan (a 300 kd proteoglycan with a 120 kd core 
protein) (for a review see Massague (1992) Cell 12 1067- 

20 1070) and more recently endoglin (a homodimer of two 95 kd 
subunits) (Cheifetz fit fii (1992) J. Biol. Chem. 212 19027- 
19030) . Current evidence suggests that type I and type II 
receptors are directly involved in receptor signal 
transduction (Segarini fit fil (1989) Mol. Endo., 3_, 261-272; 

25 Laiho fit fil (1991) J. Biol. Chem. 2H, 9100-9112) and may 
form a heteromeric complex; the type II receptor is needed 
for the binding of TGF-B to the type I receptor and the 
type I receptor is needed for the signal transduction 
induced by the type II receptor (Wrana fit fil (1992) Cell, 

30 21/ 1003-1004). The type III receptor and endoglin may 
have more indirect roles, possibly by facilitating the 
binding of ligand to type II receptors (Wang fit fil (1991) 
Cell, 12 797-805; L6pez-Casillas fit fii (1993) Cell, 22 
1435-1444) . 

35 Binding analyses with activin A and BMP4 have led to 

the identification of two co-existing cross-linked affinity 
complexes of 50-60 kDa and 70-80 kDa on responsive cells 
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(Hino ££ fii (1989) J. Biol. Chem. 2&±, 10309 - 10314; 
Mathews and Vale (1991), Cell 775-785; Paralker e£ aJL 

(1991) Proc. Natl. Acad. Sci. USA £7, 8913-8917). By 
analogy with TGF-B receptors they are thought to be 

5 signalling receptors and have been named type I and type II 
receptors. 

Among the type II receptors for the TGF-B superfamily 
of proteins, the cDNA for the activin type II receptor (Act 
RII) was the first to be cloned (Mathews and Vale (1991) 

10 Cell £5, 973-982) . The predicted structure of the receptor 
was shown to be a transmembrane protein with an 
intracellular serine/threonine kinase domain. The activin 
receptor is related to the C. eleaans daf-1 gene product, 
but the ligand is currently unknown (Georgi e£ &1 (1990) 

15 Cell §±, 635-645) . Thereafter, another form of the activin 
type II receptor (activin type IIB receptor) , of which 
there are different splicing variants (Mathews e£ al 

(1992) , Science 225., 1702-1705; Attisano e£ al (1992) Cell 
68_, 97-108), and the TGF-B type II receptor (TBRII) (Lin e£ 

20 al (1992) Cell £&, 775-785) were cloned, both of which have 
putative serine/threonine kinase domains. 
Summary of the Invention 

The present invention involves the discovery of 
related novel peptides, including peptides having the 

25 activity of those defined herein as SEQ ID Nos. 2, 4, 8, 
10, 12, 14, 16 and 18. Their discovery is based on the 
realisation that receptor serine/threonine kinases form a 
new receptor family, which may include the type II 
receptors for other proteins in the TGF-B superfamily. To 

30 ascertain whether there were other members of this family 
of receptors, a protocol was designed to clone ActRII/ daf 
I related cDNAs. This approach made use of the polymerase 
chain reaction (PCR) , using degenerate primers based upon 
the amino-acid sequence similarity between kinase domains 

35 of the mouse activin type II receptor and daf -I gene 
products. 
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This strategy resulted in the isolation of a new 
family of receptor kinases called Activin receptor like 
kinases (ALK's) 1-6. These cDNAs showed an overall 33-39% 
sequence similarity with ActRII and TGF-B type II receptor 
5~ and 40r?2* sequence similarity towards each other in the 
kinase domains. 

Soluble receptors according to the invention comprise 
at least predominantly the extracellular domain. These can 
be selected from the information provided herein, prepared 
10 in conventional manner, and used in any manner associated 
with the invention. 

Antibodies to the peptides described herein may be 
raised in conventional manner. By selecting unique 
sequences of the peptides, antibodies having desired 
15 specificity can be obtained. 

The antibodies may be monoclonal, prepared in known 
manner. In particular, monoclonal antibodies to the 
extracellular domain are of potential value in therapy. 

Products of the invention are useful in diagnostic 
20 methods, e.g. to determine the presence in a sample for an 
analyte binding therewith, such as in an antagonist assay, 
conventional techniques, e.g. an enzyme-linked 
immunosorbent assay, may be used. 

Products of the invention having a specific receptor 
25 activity can be used in therapy, e.g. to modulate 
conditions associated with activin or TGF-0 activity. Such 
conditions include fibrosis, e.g. liver cirrhosis and 
pulmonary fibrosis, cancer, rheumatoid arthritis and 
glomeronephr itis . 
30 Brief Desc ription of the Drawings 

Figure 1 shows the alignment of the serine/threonine 
(S/T) kinase domains (I-VIII) of related receptors from 
transmembrane proteins, including embodiments of the 
present invention. The nomenclature of the subdomains is 
35 accordingly to Hanks et al (1988). 

Figures 2A to 2D shows the sequences and 
characteristics of the respective primers used in the 
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initial PGR reactions. The nucleic acid sequences are also 

given as SEQ ID Nos. 19 to 22. 

Figure 3 is a comparison of the amino-acid sequences 

of human activin type II receptor (Act R-II) , mouse activin 
5 type IIB receptor (Act R-IIB) , human TGF-6 type II receptor 

(TBR-II) , human TGF-A type I receptor (ALK-5) , human 

activin receptor type IA (ALK-2 ) , and type IB ( ALK-4 ) , ALKs 

1 & 3 and mouse ALK-6, 

Figure 4 shows, schematically, the structures for Daf - 
0 1, Act R-II, Act R-IIB, TBR-II, TBR-I/ALK-5, ALK's -1, -2 

(Act RIA) , -3, -4 (Act RIB) & -6. 

Figure 5 shows the sequence alignment of the cysteine- 

rich domains of the ALKs, TBR-II, Act R-II, Act R-IIB and 

daf -1 receptors. 
5 Figure 6 is a comparison of kinase domains of 

serine/threonine kinases, showing the percentage amino-acid 

identity of the kinase domains. 

Figure 7 shows the pairwise alignment relationship 

between the kinase domains of the receptor serine/ threonine 
0 kinases. The dendrogram was generated using the Jotun-Hein 

alignment program (Hein (1990) Meth. Enzymol. 183 . 626- 

645) . 

Brief Description of the Sequence Listings 

Sequences 1 and 2 are the nucleotide and deduced 
5 amino-acid sequences of cDNA for hALK-1 (clone HP57) . 

Sequences 3 and 4 are the nucleotide and deduced 
amino-acid sequences of cDNA for hALK-2 (clone HP53). 

Sequences 5 and 6 are the nucleotide and deduced 
amino-acid sequences of cDNA for hALK-3 (clone ONF5) . 
0 Sequences 7 and 8 the nucleotide and deduced amino- 

acid sequences of cDNA for hALK-4 (clone 11H8) , 
complemented with PCR product encoding extracellular 
domain. 

Sequences 9 and 10 are the nucleotide and deduced 
5 amino-acid sequences of cDNA for hALK-5 (clone EKBLA) • 

Sequences 11 and 12 are the nucleotide and deduced 
amino-acid sequences of cDNA for mALK-1 (clone AM6) . 
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Sequences 13 and 14 are the nucleotide and deduced 
amino-acid sequences of cDNA for mALK-3 (clones ME-7 and 
ME-D) . 

Sequences 15 and 16 are the nucleotide and deduced 
5 amino-acid sequences of cDNA for mALK-4 (clone 8al) . 

Sequences 17 and 18. are the nucleotide and deduced 
amino-acid sequences of cDNA for mALK-6 (clone ME-6) . 

Sequence 19 (Bl-S) is a sense primer, extracellular 
* domain, cysteine-rich region, BamHI site at 5' end, 28-mer, 
10 64-fold degeneracy. 

Sequence 20 (B3-S) is a sense primer, kinase domain 
II, BamHI site at 5' end, 25-mer, 162-fold degeneracy. 

Sequence 21 (B7-S) is a sense primer, kinase domain 
VIB, S/T kinase specific residues, BamHI site at 5' end, 
15 24-mer, 288-fold degeneracy. 

Sequence 22 (E8-AS) is an anti-sense primer, kinase 
domain, S/T kinase-specif ic residues EcoRI site at 5' end, 
20-mer, 18-fold degeneracy. 

Sequence 23 is an oligonucleotide probe. 
20 Sequence 24 is a 5' primer. 

Sequence 25 is a 3' primer. 

Sequence 26 is a consensus sequence in Subdomain I. 
Sequences 27 and 28 are novel sequence motifs in 
Subdomain VIB. 

25 Sequence 29 is a novel sequence motif in Subdomain 

VIII. 

Description of the Invention 

As described in more detail belov, nucleic acid 
sequences have been isolated, coding for a new sub-family 

30 of serine/ threonine receptor kinases. The term nucleic 
acid molecules as used herein refers to any sequence which 
codes for the murine, human or mammalian form, amino-acid 
sequences of which are presented herein. It is understood 
that the well known phenomenon of codon degeneracy provides 

35 for a great deal of sequence variation and all such 
varieties are included within the scope of this invention. 
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The nucleic acid sequences described herein may be 
used to clone the respective genomic DNA sequences in order 
to study the genes' structure and regulation. The murine 
and human cDNA or genomic sequences can also be used to 
5 isolate the homologous genes from other mammalian species. 
The mammalian DNA sequences can be used to study the 
receptors' functions in various in vitro and in vivo model 
systems. 

As exemplified below for ALK-5 cDNA, it is also 
10 recognised that, given the sequence information provided 
herein, the artisan could easily combine the molecules with 
a pertinent promoter in a vector, so as to produce a 
cloning vehicle for expression of the molecule. The 
promoter and coding molecule must be operably linked via 

15 any of the well-recognized and easily-practised 
methodologies for so doing. The resulting vectors, as well 
as the isolated nucleic acid molecules themselves, may be 
used to transform prokaryotic cells (e.g. £. coli ) . or 
transf ect eukaryotes such as yeast (£. cerevislae ) , pae, 

20 COS or CHO cell lines. Other appropriate expression 
systems will also be apparent to the skilled artisan. 

Several methods may be used to isolate the ligands for 
the ALKs. As shown for ALK-5 cDNA, cDNA clones encoding 
the active open reading frames can be subcloned into 

25 expression vectors and transf ected into eukaryotic cells, 
for example COS cells. The transf ected cells which can 
express the receptor can be subjected to binding assays for 
radioactively-labelled members of the TGF-6 superfamily 
(TGF-6, activins, inhibins, bone morphogenic proteins and 

30 mttllerian-inhibiting substances) , as it may be expected 
that the receptors will bind members of the TGP-6 
superfamily. Various biochemical or cell-based assays can 
be designed to identify the ligands, in tissue extracts or 
conditioned media, for receptors in which a ligand is not 

35 known. Antibodies raised to the receptors may also be used 
to identify the ligands, using the immunoprecipitation of 
the cross-linked complexes. Alternatively, purified 
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receptor could be used to isolate the ligands using an 
affinity-based approach. The determination of the 
expression patterns of the receptors may also aid in the 
isolation of the ligand. These studies may be carried out 
5 using ALK DNA or RNA sequences as probes to perform in situ 
hybridisation studies. 

The use of various model systems or structural studies 
should enable the rational development of specific agonists 
and antagonists useful in regulating receptor function. It 

10 may be envisaged that these can be peptides, mutated 
ligands, antibodies or other molecules able to interact 
with the receptors. 

The foregoing provides examples of the invention 
Applicants intend to claim which includes, inter alia, 

15 isolated nucleic acid molecules coding for activin 
receptor-like kinases (ALKs) , as defined herein. These 
include such sequences isolated from mammalian species such 
as mouse, human, rat, rabbit and monkey. 

The following description relates to specific 

20 embodiments. It will be understood that the specification 
and examples are illustrative but not limitative of the 
present invention and that other embodiments within the 
spirit and scope of the invention will suggest themselves 
to those skilled in the art. 

25 Preparation of mRNA and Construction of a cDNA Library 

For construction of a cDNA library, poly (A) 4 RNA was 
isolated from a human erythroleukemia cell line (HEL 
92.1.7) obtained from the American Type Culture Collection 
(ATCC TIB 180) • These cells were chosen as they have been 

30 shown to respond to both activin and TGF-B. Moreover 
leukaemic cells have proved to be rich sources for the 
cloning of novel receptor tyrosine kinases (Partanen £t &i 
(1990) Proc. Natl. Acad. Sci. USA fil, 8913-8917 and (1992) 
Mol. Cell. Biol. 12.# 1698-1707). (Total) RNA was prepared 

35 by the guanidinium isothiocyanate method (Chirgwin 

(1979) Biochemistry ifi, 5294-5299). mRNA was selected 
using the poly-A or poly AT tract mRNA isolation kit 
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(Pr omega, Madison, Wisconsin, U.S.A.) as described by the 
manufacturers, or purified through an oligo (dT) -cellulose 
column as described by Aviv and Leder (1972) Proc. Natl. 
Acad. Sci. USA ££, 1408-1412. ^ The isolated mRNA was used 
5 for the. synthesis of random primed (Amersham) cDNA, that 
was used to make a XgtlO library with IxlO 5 independent 
cDNA ciones using the Riboclone cDNA synthesis system 
(Promega) and XgtlO in vitro packaging kit (Amersham) 
according to the manufacturers' procedures. An amplified 

10 oligo (dT) primed human placenta AZAPII cDNA library of 
5xl0 5 independent clones was used. Poly (A)* RNA isolated 
from A61518 human foreskin fibroblasts was used to prepare 
a primary random primed AZAPII cDNA library of l.SxlO 6 
independent clones using the RiboClone cDNA synthesis 

15 system and Gigapack Gold II packaging extract (Stratagene) . 
In addition, a primary oligo (dT) primed human foreskin* 
fibroblast AgtlO cDNA library (Claesson-Welsh e£ £l (1989) 
Proc. Natl. Acad. Sci. USA. ££ 4917-4912) was prepared. An 
amplified oligo (dT) primed HEL cell Agtll cDNA library of 

20 1.5 X 10 6 independent clones (Poncz e£ si (1987) Blood 62 
219-223) was used. A twelve-day mouse embryo AE XIox cDNA 
library was obtained from Novagen (Madison, Wisconsin, 
U.S.A.); a mouse placenta A ZAP I I cDNA library was also 
used. 

25 Generation of cDNA Probes by PCR 

For the generation of cDNA probes by PCR (Lee e£ j&l 
(1988) Science 239 . 1288-1291) degenerate PCR primers were 
constructed based upon the amino-acid sequence similarity 
between the mouse activin type II receptor (Mathews and 

30 Vale (1991) Cell 973-982) and daf-1 (George e£ £i 

(1990) Cell 61, 635-645) in the kinase domains II and VIII. 
Figure 1 shows the aligned serine/threonine kinase domains 
(I-VIII) , of four related receptors of the TGF-B 
superfamily, i.e. hTBR-II, mActR-IIB, mActR-II and the daf - 

35 1 gene product, using the nomenclature of the subdomains 
according to Hanks g£ al (1988) Science 241 . 45-52. 
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Several considerations were applied in the design of 
the PGR primers* The sequences were taken from regions of 
homology between the activin type II receptor and the flal-l 
gene product, with particular emphasis on residues that 
5 confer serine/ threonine specificity (see Table 2) and on 
residues that are shared by transmembrane kinase proteins 
and not by cytoplasmic kinases. The primers were designed 
so that each primer of a PCR set had an approximately 
similar GC composition, and so that self complementarity 

10 and complementarity between the 3' ends of the primer sets 
were avoided. Degeneracy of the primers was kept as low as 
possible, in particular avoiding serine, leucine and 
arginine residues (6 possible codons) , and human codon 
preference was applied. Degeneracy was particularly 

15 avoided at the 3' end as, unlike the 5' end, where 
mismatches are tolerated, mismatches at the 3' end 
dramatically reduce the efficiency of PCR. 

In order to facilitate directional subcloning, 
restriction enzyme sites were included at the 5' end of the 

20 primers, with a GC clamp, which permits efficient 
restriction enzyme digestion. The primers utilised are 
shown in Figure 2. Oligonucleotides were synthesized using 
Gene assembler plus (Pharmacia - LKB) according to the 
manufacturers instructions. 

25 ' The mRNA prepared from HEL cells as described above 

was reverse-transcribed into cDNA in the presence of 50 mM 
Tris-HCl, pH 8.3, 8 mM HgCl 2 , 30 mM KC1, 10 mM 
dithiothreitol, 2mM nucleotide triphosphates, excess oligo 
(dT) primers and 34 units of AMV reverse transcriptase at 

30 42°C for 2 hours in 40 nl of reaction volume. 
Amplification by PCR was carried out with a 7.5* aliquot (3 
/xl) of the reverse-transcribed mRNA, in the presence of 10 
mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 M MgCl 2 , 0.01% gelatin, 
0.2 mM nucleotide triphosphates, 1 pM of both sense and 

35 antisense primers and 2.5 units of Tag polymerase (Perkin 
Elmer Cetus) in 100 pi reaction volume. Amplifications 
were performed on a thermal cycler (Perkin Elmer Cetus) 
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using the following program: first 5 thermal cycles with 
denaturation for 1 minute at 94°C, annealing for 1 minute 
at 50°C, a 2 minute ramp to 55°C and elongation for 1 minute 
^ at 72°C, followed by 20 cycles of 1 minute at 94°C, 30 
5 seconds at 55°C and 1 minute at 72°C. A second round of PCR 
was performed with 2 nl of the first reaction as a 
template. This involved 25 thermal cycles, each composed 
of 94°C (1 min), 5S°C (0.5 min) , 72°C (l min) . 

General procedures such as purification of nucleic 
10 acids, restriction enzyme digestion, gel electrophoresis, 
transfer of nucleic acid to solid supports and subcloning 
were performed essentially according to established 
procedures as described by Sambrook et al, (1989) , 
Molecular cloning: A Laboratory Manual, 2 nd Ed. Cold Spring 
15 Harbor Laboratory (Cold Spring Harbor, New York, USA) . 

Samples of the PCR products were digested with BamHI 
and jgcoRI and subsequently fractionated by low melting 
point agarose gel electrophoresis. Bands corresponding to 
the approximate expected sizes, (see Table 1: *460 bp for 
20 primer pair B3-S and E8-AS and * 14 0 bp for primer pair B7- 
S and E8-AS) were excised from the gel and the DNA was 
purified. Subsequently, these fragments were ligated into 
pUC19 (Yanisch-Perron ej£ si (1985) Gene 21, 103-119), which 
had been previously linearised with £amHI and Ecp.Rl and 
25 transformed into £. coli strain DH5o using standard 
protocols (Sambrook ££ aJL, supra ) . Individual clones were 
sequenced using standard double-stranded sequencing 
techniques and the dideoxynucleotide chain termination 
method as described by Sanger e£ al (1977) Proc. Natl. 
30 Acad. Sci. USA 21, 5463-5467, and T7 DNA polymerase. 

Employing Reverse Transcriptase PCR on HEL mRNA with 
the primer pair B3-S and E8-AS, three PCR products were 
obtained, termed 11. 1, 11.2 and 11.3, that corresponded to 
novel genes. Using the primer pair B7-S and E8-AS, an 
35 additional novel PCR product was obtained termed 5.2. 
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42 


11.2 
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460 


460 
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11.3 
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460 


460 
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48 
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460 
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800 


795 


100/ND 
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B7-S/E8-AS 


140 
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40/38 
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15 Tfiolation of cDNA Clones 

The PCR products obtained were used to screen various 
cDNA libraries described supxa. Labelling of the inserts 
of PCR products was performed using random priming method 
(Feinberg and Vogelstein (1983) Anal. Biochem, 121 6-13) 

20 using the Megaprime DNA labelling system (Amersham) . The 
oligonucleotide derived from the sequence of the PCR 
product 5.2 was labelled by phosphorylation with T4 
polynucleotide kinase following standard protocols 
(Sambrook e£ al, supra ) . Hybridization and purification of 

25 positive bacteriophages were performed using standard 
molecular biological techniques. 

The double-stranded DNA clones were all sequenced 
using the dideoxynucleotide chain-termination method as 
described by Sanger e£ si, supra, using T7 DNA polymerase 

30 (Pharmacia - LKB) or Sequenase (U.S. Biochemical 
Corporation, Cleveland, Ohio, U.S.A.). Compressions of 
nucleotides were resolved using 7-deaza-GTP (U.S. 
Biochemical Corp.) DNA sequences were analyzed using the 
DNA STAR computer program (DNA STAR Ltd. U.K.). Analyses 

35 of the sequences obtained revealed the existence of six 
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distinct putative receptor serine/threonine kinases which 
have been named ALK 1-6. 

To clone cDNA for ALK-1 the oligo (dT) primed human 
placenta cDNA library was screened with a radiolabeled 
5. insert derived from the PCR product 11.3; based upon their 
restriction enzyme digestion patterns, three different 
types of clones with approximate insert sizes, of 1.7 kb, 
2 kb & 3.5 kb were identified. - The 2 kb clone, named 
HP57, was chosen as representative of this class and 

10 subjected to complete sequencing. Sequence analysis of 
ALK-1 revealed a sequence of 1984 nucleotides including a 
poly -A tail (SEQ ZD No. 1) • The longest open reading frame 
encodes a protein of 503 amino-acids, with high sequence 
similarity to receptor serine/threonine kinases (see 

15 below) . The first methionine codon, the putative 
translation start site, is at nucleotide 283-285 and is 
preceded by an in-frame stop codon. This first ATG is in 
a more favourable context for translation initiation (Kozak 
(1987) Nucl. Acids Res., 15, 8125-8148) than the second and 

20 third in-frame ATG at nucleotides 316-318 and 325-327. The 
putative initiation codon is preceded by a 5' untranslated 
sequence of 282 nucleotides that is GC-rich (80% GC) , which 
is not uncommon for growth factor receptors (Kozak (1991) 
J. Cell Biol., 115 , 887-903). The 3' untranslated sequence 

25 comprises 193 nucleotides and ends with a poly-A tail. No 
bona fide poly-A addition signal is found, but there is a 
sequence (AATACA) , 17-22 nucleotides upstream of the poly-A 
tail, which may serve as a poly-A addition signal. 

ALK- 2 cDNA was cloned by screening an amplified oligo 

30 (dT) primed human placenta cDNA library with a 
radiolabeled insert derived from the PCR product 11.2. 
Two clones, termed HP53 and HP64, with insert sizes of 2.7 
kb and 2.4 kb respectively, were identified and their 
sequences were determined. No sequence difference in the 

35 overlapping clones was found, suggesting they are both 
derived from transcripts of the same gene. 
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Sequence analysis of cDNA clone HP53 (SEQ ID No. 3) 
revealed a sequence of 2719 nucleotides vith a poly-A tail. 
The longest open reading frame encodes a protein of 509 
amino-acids. The first ATG at nucleotides 104-106 agrees 
5- favourably with Kozak's consensus sequence vith an A at 
position 3. This ATG is preceded in-frame by a stop codon. 
There are four ATG codons in close proximity further 
downstream, which agree with the Kozak's consensus sequence 
(Kozak, pupra ) , but according to Kozak's scanning model the 
10 first ATG is predicted to be the translation start site. 
The 5' untranslated sequence is 103 nucleotides. The 3' 
untranslated sequence of 1089 nucleotides contains a 
polyadenylation signal located 9-14 nucleotides upstream 
from the poly-A tail. The cDNA clone HP64 lacks 498 
15 nucleotides from the 5' end compared to HP53, but the 
sequence extended at the 3' end with 190 nucleotides and 
poly-A tail is absent. This suggests that different 
polyadenylation sites occur for ALK-2. In Northern blots, 
however, only one transcript was detected (see below). 
20 The cDNA for human ALK-3 was cloned by initially 

screening an oligo (dT) primed human foreskin fibroblast 
cDNA library with an oligonucleotide (SEQ ID No. 23) 
derived from the PCR product 5.2. One positive cDNA clone 
with an insert size of 3 kb, termed ON11, was identified. 
25 However, upon partial sequencing, it appeared that this 
clone was incomplete; it encodes only part of the kinase 
domain and lacks the extracelluar domain. The most 5' 
sequence of ON11, a 540 nucleotide Xfeal restriction 
fragment encoding a truncated kinase domain, was 
30 subsequently used to probe a random primed fibroblast cDNA 
library from which one cDNA clone with an insert size of 3 
kb, termed 0NF5, was isolated (SEQ ID No. 5) • Sequence 
analysis of ONF5 revealed a sequence of 2932 nucleotides 
without a poly-A tail, suggesting that this clone was 
35 derived by internal priming. The longest open reading 
frame codes for a protein of 532 amino-acids. The first 
ATG codon which is compatible with Kozak's consensus 
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sequence (Kozak, supra ) . is at 310-312 nucleotides and is 
preceded by an in-frame stop codon. The 5' and 3' 
untranslated sequences are 309 and 1027 nucleotides long, 
respectively. 

5 ALK-4 cDNA was identified by screening a human oligo 

(dT) primed human erythroleukemia cDNA library with the 
radiolabeled insert of the PCR product 11.1 as a probe. 
One cDNA clone, termed 11H8, was identified with an insert 
size of 2 to (SEQ ID No. 7). An open reading frame was 
10 found encoding a protein sequence of 383 amino-acids 
encoding a truncated extracellular domain with high 
similarity to receptor serine/threonine kinases. The 3' 
untranslated sequence is 818 nucleotides and does not 
contain a poly-A tail, suggesting that the cDNA was 
15 internally primed. cDNA encoding the complete" 1 

extracellular domain (nucleotides 1-366) was obtained from 
HEL cells by RT-PCR with 5' primer (SEQ ID No. 24) derived 
in part from sequence at translation start site of SKR-2 (a 
cDNA sequence deposited in GenBank data base, accesion 
20 number L10125, that is identical in part to ALK-4) and 3' 
primer (SEQ ID No. 25) derived from 11H8 cDNA clone. 

ALK-5 was identified by screening the random primed 
HEL cell Igt 10 cDNA library with the PCR product 11. l as 
a probe. This yielded one positive clone termed EMBLA 
25 (insert size of 5.3 kb with 2 internal Eco.RI sites). 
Nucleotide sequencing revealed an open reading frame of 
1509 bp, coding for 503 amino-acids. The open reading 
frame was flanked by a 5' untranslated sequence of 76 bp, 
and a 3' untranslated sequence of 3.7 kb which was not 
30 completely sequenced. The nucleotide and deduced amino- 
acid sequences of ALK-5 are shown in SEQ ID Nos. 9 and 10. 
In the 5' part of the open reading frame, only one ATG 
codon was found; this codon fulfils the rules of 
translation initiation (Kozak, supra \ . An in-frame stop 
35 codon was found at nucleotides (-54) -(-52) in the 5' 
untranslated region. The predicted ATG start codon is 
followed by a stretch of hydrophobic amino-acid residues 
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which has characteristics of a cleavable signal sequence. 
Therefore, the first ATG codon is likely to be used as a 
translation initiation site. A preferred cleavage site for 
the signal peptidase, according to von Heijne (1986) Nucl. 
5 Acid. Res. 11, 4683-4690, is located between amino-acid 
residues 24 and 25. The calculated molecular miss of the 
primary translated product of the ALK-5 without signal 
sequence is 53,646 Da. 

Screening of the mouse embryo XEX I ox cDNA library 

10 using PCR, product 11.1 as a probe yielded 20 positive 
clones. DNAs from the positive clones obtained from this 
library were digested with EcoR I and Hind lll, 
electrophoretically separated on a 1.3% agarose gel and 
transferred to nitrocellulose filters according to 

15 established procedures as described by Sambrook e£ al . 
supra . The filters were then hybridized with specific 
probes for human ALK-1 (nucleotide 288-670) , ALK-2 
(nucleotide 1-581), ALK-3 (nucleotide 79-824) or ALK-4 
nucleotide 1178-1967) . Such analyses revealed that a clone 

20 termed ME-7 hybridised with the human ALK-3 probe. 
However, nucleotide sequencing revealed that this clone was 
incomplete, and lacked the 5' part of the translated 
region. Screening the seme cDNA library with a probe 
corresponding to the extracelluar domain of human ALK-3 

25 (nucleotides 79-824) revealed the clone ME-D. This clone 
was isolated and the sequence was analyzed. Although this 
clone was incomplete in the 3' end of the translated 
region, ME-7 and ME-D overlapped and together covered the 
complete sequence of mouse ALK-3 . The predicted amino-acid 

30 sequence of mouse ALK-3 is very similar to the human 
sequence; only 8 amino-acid residues differ (98% identity; 
see SEQ ID No. 14) and the calculated molecular mass of the 
primary translated product without the putative signal 
sequence is 57,447 Da. 

35 Of the clones obtained from the initial library 

screening with PCR product 11.1, four clones hybridized to 
the probe corresponding to the conserved kinase domain of 
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ALK-4 but not to probes from more divergent parts of ALK-1 
to -4. Analysis of these clones revealed that they have an 
identical sequence which differs from those of ALK-1 to -5 
and was termed ALK-6. The longest clone KE6 with a 2.0 kb 
5 insert was completely sequenced yielding a 1952 bp fragment 
consisting of an open reading frame of 1506 bp (502 aminos 
acids), flanked by a 5' untranslated sequence of 186 bp, 
and a 3' untranslated sequence of 160 bp. The nucleotide 
and predicted amino-acid sequences of mouse ALK-6 are shown 

10 in SEQ ID Nos. 17 and 18. No polyadenylation signal was 
found in the 3' untranslated region of ME6, indicating that 
the cDNA was internally primed in the 3' end. Only one ATG 
codon was found in the 5' part of the open reading frame, 
which fulfils the rules for translation initiation (Kozak, 

15 supra ) , and was preceded by an in-f rame stop codon at 
nucleotides 163-165. However, a typical hydrophobic leader 
sequence was not observed at the N terminus of the 
translated region. Since there is no ATG codon and 
putative hydrophobic leader sequence, this ATG codon is 

20 likely to be used as a translation initiation site. The 
calculated molecular mass of the primary translated product 
with the putative signal sequence is 55,576 Da. 

Mouse ALK-1 (clone AM6 with 1.9 kb insert) was 
obtained from the mouse placenta A Z API I cDNA library using 

25 human ALK-1 cDNA as a probe (see SEQ ID No. 11). Mouse 
ALK-4 (clone 8al with 2.3kb insert) was also obtained from 
this library using human ALK-4 cDNA library as a probe (SEQ 
ID No. 15) . 

To summarise, clones HP22, HP57, ONF1, ONF3, 0NF4 and 
30 HP29 encode the same gene, ALK-1. Clone AM6 encodes mouse 
ALK-1. HP53, HP 6 4 and HP84 encode the same gene, ALK-2. 
ONF5, ONF2 and 0N11 encode the same gene ALK-3. ME-7 and 
ME-D encode the mouse counterpart of human ALK-3. 11H8 
encodes a different gene ALK-4, whilst 8al encodes the 
35 mouse equivalent. EKBLA encodes ALK-5, and ME- 6 encodes 
ALK-6. 
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The sequence alignment between the 6 ALK genes end 
TBR-II, mActR-II and ActR-IIB is shown in Figure 3. These 
molecules have a similar domain structure; an N-terminal 
predicted hydrophobic signal .sequence (von Heijne (1986) 
5 Nucl. Acids Res. H: 4683-4690) is followed by a relatively 
small extracellular cysteine-rich ligand binding domain, a 
single hydrophobic transmembrane region (Kyte & Doolittle 
(1982) J. Mol. Biol. 157 . 105-132) and a C-terminal 
intracellular portion, which consists almost entirely of a 

10 kinase domain (Figures 3 and 4). 

The extracelluar domains of these receptors have 
cysteine-rich regions, but they show little sequence 
similarity; for example, less than 20% sequence identity is 
found between Daf -1, ActR-II, TBR-II and ALK-5. The ALKs 

15 appear to form a subfamily as they show higher sequence 
similarities (15-47% identity) in their extracellular 
domains. The extracellular domains of ALK-5 and ALK-4 have 
about 29% sequence identity. In addition, ALK- 3 and ALK-6 
share a high degree of sequence similarity in their 

20 extracellular domains (46% identity). 

The positions of many of the cysteine residues in all 
receptors can be aligned, suggesting that the extracellular 
domains may adopt a similar structural configuration. See 
Figure 5 for ALKs-1,-2,-3 &- 5. Each of the ALKs (except 

25 ALK-6) has a potential N- linked glycosylation site, the 
position of which is conserved between ALK-1 and ALK-2 , and 
between ALK-3, ALK-4 and ALK-5 (see Figure 4). 

The sequence similarities in the kinase domains 
between daf -1. ActR-II, TBR-II and ALK-5 are approximately 

30 40%, whereas the sequence similarity between the ALKs 1 to 
6 is higher (between 59% and 90%; see Figure 6). Pairvise 
comparison using the Ju tun-He in sequence alignment program 
(Hein (1990) Meth, Enzymol., IS2, 626-645), between all 
family members, identifies the ALKs as a separate subclass 

35 among serine/threonine kinases (Figure 7) . 

The catalytic domains of kinases can be divided into 
12 subdomains with stretches of conserved amino-acid 
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residues. The key motifs are found in serine/threonine 
kinase receptors suggesting that they are functional 
kinases. The consensus sequence for the binding of ATP 
(Gly-X-Gly-X-X-Cly in subdomain I followed by a Lys residue 
5 further downstream in subdomain II) is found in all the 
ALKs. 

The kinase domains of flaf-l, ActR-Il, and ALKs show 
approximately equal sequence similarity with tyrosine and 
serine/threonine protein kinases. However analysis of the 

10 amino-acid sequences in subdomains VI and VTII, which are 
the most useful to distinguish a specificity for 
phosphorylation of tyrosine residues versus 
serine/threonine residues (Hanks et al (1988) Science ZA1 
42-52) indicates that these kinases are serine/threonine 

15 kinases; refer to Table 2. 
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SUB DOHA 


TUB 




VIB 


VIII 


Serine/threonine kinase consensus 


DLKPEN 


6 (T/S) XX 
(Y/F) X 


| Tyrosine kinase consensus 


DLAARN 


XP(I/V) 
(K/R) W 
(T/M) 


[ Act R-II 


DIKSKN 


GTRRYM 


Act R-IIB 


DFKSKN 


GTRRYM 


T6R-II 


DLKSSN 


GTARYM 


ALK-I 


DFKSRN 


GTKRYM 


ALK -2, -3, -4, -5, & -6 


DLKSKN 


GTKRYM 



The sequence motifs DLKSKN (Subdomain VIB) and GTKRYM 
(Subdomain VIII) , that are found in most of the 
serine/threonine kinase receptors, agree well with the 

15 consensus sequences for all protean serine/ threonine kinase 
receptors in these regions. In addition, these receptors, 
except for ALK-1, do not have a tyrosine residue surrounded 
by acidic residues between subdomains VII and VIII, which 
is common for tyrosine kinases. A unique characteristic of 

20 the members of the ALK serine /threonine kinase receptor 
family is the presence of two short inserts in the kinase 
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domain between subdomains VIA and VIB and between 
subdomains X and XI. In the intracellular domain, these 
regions, together with the juxtamembrane part and C- 
terminal tail, are the most divergent between family 
5 members (see Figures 3 and 4). Based on the sequence 
similarity with the type II receptors for TGF-B and 
activin, the. C termini of the kinase domains of ALKs -1 to 
-6 are set at Ser-4S5, Ser-501, Ser-527, Gln-500, Gln-498 
and Ser-497, respectively. 

10 mRNA Expression 

The distribution of ALK-1, -2, -3, -4 was determined 
by Northern blot analysis. A Northern blot filter with 
mRNAs from different human tissues was obtained from 
Clontech (Palo Alto, C.A.). The filters were hybridized 

15 with 32 P-labelled probes at 42°C overnight in 50% 
formaldehyde, 5 x standard saline citrate (SSC; lxSSC is 
5CmM sodium citrate, pH 7.0, 150 mM NaCl) , 0.1* SDS, 50 mM 
sodium phosphate, 5 x Denhardt's solution and 0.1 mg/ml 
salmon sperm DNA. In order to minimize cross- 

20 hybridization, probes were used that did not encode part of 
the kinase domains, but corresponded to the highly diverged 
sequences of either 5' untranslated and ligand-binding 
regions (probes for ALK-1, -2 and -3) or 3' untranslated 
sequences (probe for ALK-4). The probes were labelled by 

25 random priming using the Multiprime (or Mega-prime) DNA 
labelling system and [o- 32 P] dCTP (Feinberg & Vogelstein 
(1983) Anal. Biochem. 132: 6-13). Unincorporated label was 
removed by Sephadex G-25 chromatography. Filters were 
washed at 65°C, twice for 30 minutes in 2.5 x SSC, 0.1* SDS 

30 and twice for 30 minutes in 0.3 x SSC, 0.1* SDS before 
being exposed to X-ray film. Stripping of blots was 
performed by incubation at 90-100°C in water for 20 
minutes . 

The ALK-5 mRNA size and distribution were determined 
35 by Northern blot analysis as above. An Eco Rl fragment of 
980bp of the full length ALK-5 cDNA clone, corresponding to 
the C-terminal part of the kinase domain and 3' 
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untranslated region (nucleotides 1259-2232 in SEQ ID No. 9) 
was used as a probe. The filter was washed twice in 0.5 x 
SSC, 0.1% SDS at 55°C for 15 minutes. 

Using the probe for ALK-1 . two transcripts of 2.2 and 
5 4.9k& were detected. The ALK-1 expression level varied 
strongly between different tissues, high in placenta and 
lung, moderate in heart, muscle and kidney, and low (to not 
detectable) in brain, liver and pancreas. The relative 
ratios between the two transcripts were similar in most 
10 tissues; in kidney, however, there was relatively more of 
the 4.9 kb transcript. By reprobing the blot with a probe 
for ALK-2, one transcript of 4.0 kb was detected with a 
ubiquitous expression pattern. Expression was detected in 
every tissue investigated and was highest in placenta and 
skeletal muscle. Subsequently the blot was reprobed for 
ALK-3. One major transcript of 4.4 kb and a minor 
transcript of 7.9 kb were detected. Expression was high in 
skeletal muscle, in which also an additional minor 
transcript of 10 kb was observed. Moderate levels of ALK-3 
mRNA were detected in heart, placenta, kidney and pancreas, 
and low (to not detectable) expression was found in brain, 
lung and liver. The relative ratios between the different 
transcripts were similar in the tested tissues, the 4.4 kb 
transcript being the predominant one, with the exception 
for brain where both transcripts were expressed at a 
similar level. Probing the blot with ALK-4 indicated the 
presence of a transcript with the estimated size of 5.2 kb 
and revealed an ubiquitous expression pattern. The results 
of Northern blot analysis using the probe for ALK-5 showed 
that a 5.5 kb transcript is expressed in all human tissues 
tested, being most abundant in placenta and least abundant 
in brain and heart. 

The distribution of mRNA for mouse ALK-3 and -6 in 
various mouse tissues was also determined by Northern blot 
analysis. A multiple mouse tissue blot was obtained from 
Clontech, Palo Alto, California, U.S.A. The filter was 
hybridized as described above with probes for mouse ALK-3 
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and ALK-6. The EssRI-£s£I restriction fragment , 
corresponding to nucleotides 79-1100 of ALK-3, and the 
Sac l- Hpa l fragment, corresponding to nucleotides 57-720 of 
ALK-6, were used as probes. The filter was washed at 65°C 
5 twice for 30 minutes in 2.5 x SSC, 0.1% SDS and twice for 
30 minutes with 0.3 x SSC, 0*1% SDS and then subjected to 
autoradiography . 

Using the probe for mouse ALK-3, a 1.1 kb transcript 
was found only in spleen. By reprobing the blot with the 

10 ALK-6 specific probe, a transcript of 7.2 kb was found in 
brain and a weak signal was also seen in lung. No other 
signal was seen in the other tissues tested, i.e. heart, 
liver, skeletal muscle, kidney and testis. 

All detected transcript sizes were different, and thus 

15 no cross-reaction between mRNAs for the different ALKs was- 
observed when the specific probes were used. This suggests 
that the multiple transcripts of ALK-1 and ALK-3 are coded 
from the same gene. The mechanism for generation of the 
different transcripts is unknown at present; they may be 

20 formed by alternative mRNA splicing, differential 
polyadenylation, use of different promotors, or by a 
combination of these events. Differences in mRNA splicing 
in the regions coding for the extracellular domains may 
lead to the synthesis of receptors with different 

25 affinities for ligands, as was shown for mActR-IIB 
(Attisano et al (1992) Cell ££, 97-108) or to the 
production of soluble binding protein. 

The above experiments describe the isolation of 
nucleic acid sequences coding for new family of human 

30 receptor kinases. The cDNA for ALK-5 was then used to 
determine the encoded protein size and binding properties. 
Properties of the ALKs cDN A Encoded Proteins 

To study the properties of the proteins encoded by the 
different ALK cDNAs, the cDNA for each ALK was subcloned 

35 into a eukaryotic expression vector and transfected into 
various cell types and then subjected to 
immunoprecipitation using a rabbit antiserum raised against 
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a synthetic peptide corresponding to part of the 
intracellular juxtamembrane region. This region is 
divergent in sequence between the various serine/ threonine 
kinase receptors. The following amino-acid residues were 
. 5 used: 

ALK-1 145-166 
ALK-2 151-172 
ALK-3 181-202 
ALK-4 153-171 

10 ALK-5 158-179 
ALK-6 151-168 

The rabbit antiserum against ALK-5 was designated VPN. 
The peptides were synthesized with an Applied 
Biosystems 4 3 OA Peptide Synthesizer using t-butoxycarbonyl 

15 chemistry and purified by reversed-phase high performance 
liquid chromatography. The peptides were coupled to 
keyhole limpet haemocyanin (Calbiochem-Behring) using 
glutaraldehyde, as described by Guillick &1 (1985) EMBO 
J. A, 2869-2877. The coupled peptides were mixed with 

20 Freunds adjuvant and used to immunize rabbits. 
Transient transfection of the ALK-5 cDNA 

COS-1 cells (American Type Culture Collection) and the 
R mutant of MvlLu cells (for references, see below) were 
cultured in Dulbecco's modified Eagle's medium containing 

25 10% fetal bovine serum (FBS) and 100 units/ml penicillin 
and 50 /xg 1ml streptomycin in 5% C0 2 atmosphere at 37°C. 
The ALK-5 cDNA (nucleotides (-76) - 2232) , which includes 
the complete coding region, was cloned in the pSV7d vector 
(Truett et al, (1985) DNA A, 333-349), and used for 

30 transfection. Transfection into COS-1 cells was performed 
by the calcium phosphate precipitation method (Wigler e£ 
(1979) Cell i£, 777-785). Briefly, cells were seeded into 
6-well cell culture plates at a density of 5xl0 5 
cells/well, and transfected the following day with 10 fig of 

35 recombinant plasmid. After overnight incubation, cells 
were washed three times with a buffer containing 25 mM 
Tris-HCl, pH 7.4, 138 mM NaCl, 5 mM KC1, 0.7 mM CaCl 2 , 0.5 
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mM MgCl 2 and 0.6 mM NajHP0 4 , and then incubated with 
Dulbecco's modified Eagle's medium containing FBS and 
antibiotics/ Two days after transfection, the cells were 
metabolically labelled by incubating the cells for 6 hours 
5. in methionine and cysteine-free MCDB 104 medium with 150 
pCi/ml of [^S] -methionine and [ 35 S] -cysteine (in vivo 
labelling mix; Amersham) . After labelling, the cells were 
washed with 150 mM NaCI, 25 mM Tris-HCl, pH 7.4, and then 
solubilized with a buffer containing 20mM Tris-HCl, pH 7.4, 

10 150 mM NaCI, 10 mM EDTA, 1% Triton X-100, 1% deoxycholate, 
1.5% Trasylol (Bayer) and 1 mM phenylmethylsulf onylf luoride 
(PMSF; Sigma). After 15 minutes on ice, the cell lysates 
were pelleted by centrifugation, and the supernatants were 
then incubated with 7 fil of preimmune serum for 1.5 hours 

15 at 4°C. Samples were then given 50 pi of protein A— 
Sepharose (Pharmacia-LKB) slurry (50% packed beads in 150 
mM NaCI, 20 mM Tris-HCl, pH 7.4, 0.2% Triton X100) and 
incubated for 45 minutes at 4°C. The beads were spun down 
by centrifugation, and the supernatants (1 ml) were then 

20 incubated with either 7 /il of preimmune serum or the VPN 
antiserum for 1.5 hours at 4°C. For blocking, 10 ng of 
peptide was added together with the antiserum. Immune 
complexes were then given 50 til of protein A-Sepharose 
(Pharmacia - LKB) slurry (50% packed beads in 150 mM NaCI, 

25 20mM Tris-HCl, pH 7.4, 0.2% Triton X-100) and incubated for 
45 minutes at 4°C. The beads were spun down and washed 
four times with a washing buffer (20 mM Tris-HCl, pH 7.4, 
500 mM NaCI, 1% Triton X-100, 1% deoxycholate and 0.2% 
SDS) , followed by one wash in distilled water. The immune 

30 complexes were eluted by boiling for 5 minutes in the SDS- 
sample buffer (100 mM Tris-HCl, pH 8.8, 0.01% bromophenol 
blue, 36% glycerol, 4% SDS) in the presence of 10 mM DTT, 
and analyzed by SDS-gel electrophoresis using 7-15% 
polyacrylamide gels (Blobel and Dobberstein, (1975) J. Cell 

35 Biol. 67, 835-851). Gels were fixed, incubated with 
Amplify (Amersham) for 20 minutes, and subjected to 
fluorography. A component of 5 3 Da was seen. This 
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component was not seen when preimmune serum was used, or 
when 10 M9 blocking peptide was added together with the 
antiserum. Moreover, it was not detectable in samples 
derived from untransf ected COS-1 cells using either 
5- preimmune serum or the antiserum. 
picrestion with Endoalycosidase F 

Samples immunoprecipitated with the VPN antisera 
obtained as described above were "incubated with 0*5 U of 
endoglycosidase F (Boehringer Mannheim Biochemica) in a 

10 buffer containing 100 mM sodium phosphate, pH 6.1, 50 mM 
EDTA, 1% Triton X-100, 0.1% SDS and 1% fi-mercaptoethanol at 
37°C for 24 hours. Samples were eluted by boiling for 5 
minutes in the SDS-sample buffer, and analyzed by SDS- 
polyacrylamide gel electrophoresis as described above. 

15 Hydrolysis of N-linked carbohydrates by endoglycosidase F 
shifted the 53 kDa band to 51 XDa. The extracelluar domain 
of ALK-5 contains one potential acceptor site for N- 
glycosylation and the size of the deglycosylated protein is 
close to the predicted size of the core protein. 

20 Establishment of PAE Cell Lines Expressing ALK-5 

In order to investigate whether the ALK-5 cDNA encodes 
a receptor for TGF-B, porcine aortic endothelial (PAE) 
cells were transfected with an expression vector containing 
the ALK-5 cDNA, and analyzed for the binding of 125 I-TGF-B1. 

25 PAE cells were cultured in Ham's F-12 medium 

supplemented with 10% FBS and antibiotics (Miyazono e£ £l« , 
(1988) J. Biol. Chem. 231, 6407-6415). The ALK-5 cDNA was 
cloned into the cytomegalovirus (CMV) -based expression 
vector pcDNA I/NEO (Invitrogen) , and transfected into PAE 

30 cells by electroporation. After 48 hours, selection was 
initiated by adding Geneticin (G418 sulphate; Gibco • BRL) 
to the culture medium at a final concentration of 0.5 mg/ml 
(Westermark si. , (1990) Proc. Natl. Acad. Sci. USA £2, 
128-132) . Several clones were obtained, and after analysis 

35 by immunoprecipitation using the VPN antiserum, one clone 
denoted PAE/TBR-1 was chosen and further analyzed. 
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Todination of TGF-fll, Binding and Affinity ftrosslinkincr 

Recombinant human TGF-B1 was iodinated using the 
chloramine T method according to Frolik &1* , (1984) J. 
Biol. Chem. 252/ 10995-11000. Cross-linking experiments 
5 were performed as previously described (Ichijo <g£ al. r 
(1990) Exp. Cell Res. IfiZ, 263-269). Briefly, cells in 6- 
vell plates were washed with binding buffer (phosphate- 
buffered saline containing 0.9 mM CaCl 2 , 0.49 nH MgCl 2 and 
1 mg/ml bovine serum albumin (BSA) ) , and incubated on ice 

10 in the same buffer with I-TGF-Bl in the presence or 
absence of excess unlabelled TGF-B1 for 3 hours. Cells 
were washed and cross-linking was done in the binding 
buffer without BSA together with 0.28 mM disuccinimidyl 
suberate (DSS; Pierce Chemical Co.) for 15 minutes on ice. 

15 The cells were harvested by the addition of 1 ml of., 
detachment buffer (10 mM Tris-HCl, pH 7.4, 1 mM EDTA, 10% 
glycerol, 0.3 mM PMSF) . The cells were pelleted by 
centrifugation, then resuspended in 50 /il of solubilization 
buffer (125 mM NaCl, 10 mM Tris-HCl, pH 7.4, 1 mM EDTA, 1% 

20 Triton X-100, 0.3 mM PMSF, 1% Trasylol) and incubated for 
40 minutes on ice. Cells were centrifuged again and 
supernatants were subjected to analysis by SDS-gel 
electrophoresis using 4-15% polyacrylamide gels, followed 
by autoradiography. 125 I-TGF-B1 formed a 70 kDa cross- 

25 linked complex in the transfected PAE cells (PAE/TBR-I 
cells) . The size of this complex was very similar to that 
of the TGF-B type I receptor complex observed at lower 
amounts in the untransf ected cells. A concomitant increase 
of 94 kDa TGF-B type II receptor complex could also be 

30 observed in the PAE/TBR-I cells. Components of 150-190 
kDa, which may represent crosslinked complexes between the 
type I and type II receptors, were also observed in the 
PAE/TBR-I cells. 

In order to determine whether the cross-linked 70 kDa 

35 complex contained the protein encoded by the ALK-5 cDNA, 
the affinity cross-linking was followed by 
immunoprecipitation using the VPN antiserum. For this, 
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cells in 25 cm 2 flasks were used. The supernatants 
obtained after cross-linking were incubated with 7 /*1 of 
pre immune serum or VPN antiserum in the presence or absence 
of 10 M9 of peptide for 1.5h at 4°C. Immune complexes were 
5 then added to 50 /xl of protein A-Sepharose slurry and 
incubated for 45 minutes at 4°C. The protein A-Sepharose 
beads were washed four times with the washing buffer, once 
with distilled water, and the samples were analyzed by SDS- 
gel electrophoresis using 4-15% polyacrylamide gradient 

10 gels and autoradiography. A 70 kDa cross-linked complex 
was precipitated by the VPN antiserum in PAE/TBR-1 cells, 
and a weaker band of the same size was also seen in the 
untransfected cells, indicating that the untransf ected PAE 
cells contained a low amount of endogenous ALK-5. The 70 

15 kDa complex was not observed when pre immune serum was used, 
or when immune serum was blocked by 10 jig of peptide. 
Moreover, a coprecipitated 94 kDa component could also be 
observed in the PAE/TBR-I cells. The latter component is 
likely to represent a TGF-6 type II receptor complex, since 

20 an antiserum, termed DRL, which was raised against a 
synthetic peptide from the C-terminal part of the TGF-B 
type II receptor, precipitated a 94 kDa TGF-B type II 
receptor complex, as well as a 70 kDa type I receptor 
complex from PAE/TBR-I cells. 

25 The carbohydrate contents of ALK-5 and the TGF-B type 

II receptor were characterized by deglycosylation using 
endoglycosidase F as described above and analyzed by SDS- 
polyacrylamide gel electrophoresis and autoradiography. 
The ALK-5 cross-linked complex shifted from 70 kDa to 66 

30 kDa, whereas that of the type II receptor shifted from 94 
kDa to 82 kDa. The observed larger shift of the type II 
receptor band compared with that of the ALK-5 band is 
consistent with the deglycosylation data of the type I and 
type II receptors on rat liver cells reported previously 

35 (Cheifetz e£ al (1988) J. Biol. Chem. 2£2, 16984-16991), 
and fits well with the fact that the porcine TGF-B type II 
receptor has two N-glycosylation sites (Lin e£ al (1992) 
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Cell IS., 775-785) , whereas ALK-5 has only one (see SEQ ID 
No. 9). 

Binding of TGF-B1 to the type I receptor is known to 
be abolished by- transient treatment of the cells with 
5 dithiothreitol (DTT) (Cheifetz and Massague (1991) J. Biol. 
Chem. 2££, 20767-20772; Wrana al (1992) Cell 21, 1003- 
1014) . When analyzed by affinity cross-linking, binding of 
125 I-TGF-B1 to ALK-5, but not to the type II receptor, was 
completely abolished by DTT treatment of PAE/TBR-1 cells. 

10 Affinity cross-linking followed by immunoprecipitation by 
the VPN antiserum showed that neither the ALK-5 nor the 
type II receptor complexes was precipitated after DTT 
treatment, indicating that the VPN antiserum reacts only 
with ALK-5. The data show that the VPN antiserum 

15 recognizes a TGF-B type I receptor, and that the type I and 
type II receptors form a heteromeric complex. 
125 I-TGF-B1 Binding & Affinity Crosslinking of Transfected 
COS Cells 

Transient expression plasmids of ALKs -1 to -6 and 

20 TBR-II were generated by subcloning into the pSV7d 
expression vector or into the pcDNA I expression vector 
(Invitrogen) . Transient transfection of COS-1 cells and 
iodination of TGF-B 1 were carried out as described above* 
Crosslinking and immunoprecipitation were performed as 

25 described for PAE cells above. 

Transfection of cDNAs for ALKs into COS-1 cells did 
not show any appreciable binding of 125 I-TGFB1, consistent 
with the observation that type I receptors do not bind TGF- 
B in the absence of type II receptors. When the TBR-II 

30 cDNA was co-trans fected with cDNAs for the different ALKs, 
type I receptor-like complexes were seen, at different 
levels, in each case. COS-1 cells transfected with TBR-II 
and ALK cDNAs were analyzed by affinity crosslinking 
followed by immunoprecipitation using the DRL antisera or 

35 specific antisera against ALKs. Each one of the ALKs bound 
125 I-TGF-fll and was coimmunoprecipitated with the TBR-II 
complex using the DRL antiserum. Comparison of the 
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efficiency of the different ALKs to torn heteromeric 
complexes with TBR-II, revealed that ALK-5 formed such 
complexes more efficiently than the other ALKs. The size 
of the cross linked complex was larger for ALK- 3 than for 
5 other ALKs, consistent with its slightly larger size. 
Expression of the ALK Protein in Diffe rent Cell Types 

Two different approaches were used to elucidate which 
ALK's are physiological type I receptors for TGF-fl. 

Firstly, several cell lines were tested for the 

10 expression of the ALK proteins by cross-linking followed by 
immunoprecipitation using the specific antiseras against 
ALKs and the TGF-B type II receptor. The mink lung 
epithelial cell line, MvlLu, is widely used to provide 
target cells for TGF-B action and is well characterized 

15 regarding TGF-B receptors (Laiho et aj, (1990) J. Biol. 
Chem. 265 . 18518-18524; Laiho et aJL (1991) J. Biol. Chem. 
266 . 9108-9112) . Only the VPN antiserum efficiently 
precipitated both type I and type II TGF-B receptors in the 
wild type MvlLu cells. The DRL antiserum also precipitated 

20 components with the same size as those precipitated by the 
VPN antiserum. A mutant cell line (R mutant) which lacks 
the TGF-B type I receptor and does not respond to TGF-B 
(Laiho et al, supra ) was also investigated by cross-linking 
followed by immunoprecipitation. Consistent with the 

25 results obtained by Laiho et al (1990) , supra the type III 
and type II TGF-B receptor complexes, but not the type I 
receptor complex, were observed by affinity crosslinking. 
Crosslinking followed by immunoprecipatition using the DRL 
antiserum revealed only the type II receptor complex, 

30 whereas neither the type I nor type II receptor complexes 
was seen using the VPN antiserum. When the cells were 
metabolically labelled and subjected to immunoprecipitation 
using the VPN antiserum, the 53 kDa ALK-5 protein was 
precipitated in both the wild-type and R mutant MvlLu 

35 cells. These results suggest that the type I receptor 
expressed in the R mutant is ALK-5, which has lost the 
affinity for binding to TGF-B after mutation. 
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The type I and type IX TGF-5 receptor complexes could 
be precipitated by the VPN and DRL antisera in other cell 
lines, including human foreskin fibroblasts (AG1518) , human 
lung adenocarcinoma cells (A549), and human oral squamous 
5 cell carcinoma cells (HSC-2). Affinity cross-linking 
studies revealed multiple TGF-B type I receptor-like 
complexes of 70-77 kDa in these cells* These components 
were less efficiently competed by excess unlabelled TGF-B 1 
in HSC-2 cells. Moreover, the type II receptor complex was 

10 low or not detectable in A549 and HSC-2 cells. Cross- 
linking followed by immunoprecipitation revealed that the 
VPN antiserum precipitated only the 70 kDa complex among 
the 70-77 kDa components. The DRL antiserum precipitated 
the 94 kDa type II receptor complex as veil as the 70 kDa 

15 type I receptor complex in these cells, but not the 
putative type I receptor complexes of slightly larger 
sizes. These results suggest that multiple type I TGF-B 
receptors may exist and that the 70 kDa complex containing 
ALK-5 forms a heteromeric complex with the TGF-B type II 

20 receptor cloned by Lin et (1992) Cell ££/ 775-785, more 
efficiently that the other species. In rat 

pheochromocytoma cells (PC12) which have been reported to 
have no TGF-B receptor complexes by affinity cross-linking 
(Massagu6 et al (1990) Ann. N.Y. Acad. Sci. 593 . 59-72), 

25 neither VPN nor DRL antisera precipitated the TGF-B 
receptor complexes. The antisera against ALKs -1 to -4 and 
ALK6 did not efficiently immunoprecipitate the cross linked 
receptor complexes in porcine aortic endothelial (PAE) 
cells or human foreskin fibroblasts. 

30 Next, it was investigated whether ALKs could restore 

responsiveness to TGF-6 in the R mutant of MvlLu cells, 
which lack the ligand-binding ability of the TGF-B type I 
receptor but have intact type II receptor. Wild-type MvlLu 
cells and mutant cells were rransfected with ALK cDNA and 

35 were then assayed for the production of plasminogen 
activator inhibitor-1 (PAI-1) which is produced as a result 
of TGF-B receptor activation as described previously by 



WO 94/USU2 



KCi/OByj/UZ«M>/ 



32 

Laiho fit al (1991) Mol. Cell Biol. H, 972-978. Briefly, 
cells were added with or without 10 ng/ml of TGF-fil for 2 
hours in serum-free MCDB 104 without methionine. 
Thereafter, cultures were labelled with [ n S] methionine (40 
5 pCt /ml) for 2 hours. The cells were removed by washing on 
ice once in PBS, twice in 10 mM Tris-HCl (pH 8.0), 0.5% 
sodium deoxy chelate, 1 mM PMSF, twice in 2 mM Tris-HCl (pH 
8.0), and once in PBS. Extracellular matrix proteins were 
extracted by scraping cells into the SDS-sample buffer 

10 containing DTT, and analyzed by SDS-gel electrophoresis 
followed by fluorography using Amplify. PAI-1 can be 
identified as a characteristic 45kDa band (Laiho efc 
(1991) Mol. Cell Biol. H, 972-978). Wild-type MvlLu cells 
responded to TGF-B and produced PAI-1, whereas the R mutant 

15 clone did not, even after stimulation by TGF-B1. Transient 
transf ection of the ALK-5 cDNA into the R mutant clone led 
to the production of PAI-1 in response to the stimulation 
by TGF-B1, indicating that the ALK-5 cDNA encodes a 
functional TGF-B type I receptor. In contrast, the R 

20 mutant cells that were transfected with other ALKs did not 
produce PAI-1 upon the addition of TGF-B 1. 

Using similar approaches as those described above for 
the identification of TGF-B-binding ALKs, the ability of 
ALKs to bind activin in the presence of ActRII was 

25 examined. COS-1 cells were co-transf ected as described 
above. Recombinant human activin A was iodinated using the 
chloramine T method (Mathews and Vale (1991) Cell ££, 973- 
982). Transfected C0S-1 cells were analysed for binding 
and crosslinking of 125 I-activin A in the presence or 

30 absence of excess unlabelled activin A. The crosslinked 
complexes were subjected to immunoprecipitation using DRL 
antisera or specific ALK antisera. 

All ALKs appear tc bind activin A in the presence of 
Act R-II. This is more clearly demonstrated by affinity 

35 cross-linking followed by immunopreciptation. ALK-2 and 

125 

ALK- 4 bound I-activin A and were coimmunoprecipitated 
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with ActR-II. Other ALKs also bound 12S I-activin A but with 
a lower efficiency compared to ALK-2 and ALK-4. 

Zn order to investigate whether ALKs are physiological 
activin type I receptors, activin responsive cells were 
5 examined for the expression of endogenous activin type I 
receptors. MvlLu cells,, as well as the R mutant, express 
both type I and type II receptors for activin, and the R 
mutant cells produce PAI-1 upon the addition of activin A. 
MvlLu cells were labeled with I-activin A, cross-linked 

10 and immunoprecipitated by the antisera against ActR-II or 
ALKs as described above* 

The type I and type II receptor complexes in MvlLu 
cells were immunoprecipitated only by the antisera against 
ALK-2, ALK-4 and ActR-II. Similar results were obtained 

15 using the R mutant cells. PAE cells do not bind activin 
because of the lack of type II receptors for activin, and 
so cells were transfected with a chimeric receptor , to 
enable them to bind activin, as described herein. A 
plasmid (chim A) containing the extracelluar domain and C- 

20 terminal tail of Act R-II (amino-acids -19 to 116 and 465 
to 494, respectively (Mathews and Vale (1991) Cell, £1, 
973-982)) and the kinase domain of TBR-II (amino-acids 160- 
543) (Lin e£ al (1992) Cell, £8, 775-785) was constructed 
and transfected into pcDNA/neo (Invitrogen) . PAE cells 

25 were stably transfected with the chim A plasmid by 
electroporation, and cells expressing the chim A protein 
were established as described previously. PAE/Chim A cells 
were then subjected to 125 I-activin A labelling crosslinking 
and immunoprecipitation as described above. 

30 Similar to MvlLu cells, activin type I receptor 

complexes in PAE/Chim A cells were immunoprecipitated by 
the ALK-2 and ALK-4 antisera. These results show that both 
ALK-2 and ALK-4 serve as high affinity type I receptors for 
activin A in these cells. 

35 ALK-1, ALK-3 and ALK-6 bind TGF-Bl and activin A in 

the presence of their respective type II receptors, but the 
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functional consequences of the binding of the ligands 
remains to be elucidated* 

The invention has been described by vay of example 
only, without restriction of its scope. The invention is 
5 defined by the subject matter herein, including the claims 
that follow the immediately following full Sequence 
Listings. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS 

(A) NAME: Ludwig Institute for Cancer Research 

(B) STREET: St. Mary's Hospital Medical School, Norfolk 

Place 

(C) CITY: Paddington, London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): W2 IPG 

(ii) TITLE OF INVENTION: PROTEINS HAVING SERINE /THREONINE KINASE 
DOMAINS, CORRESPONDING NUCLEIC ACID MOLECULES, AND THEIR 
USE 

(iii) NUMBER OF SEQUENCES $ 29 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/M?-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 283.. 1791 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGCAG GCTCCCCCCC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 



CCAGCGCTGG CGGTCCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCOGA 



240 



WO 94/11502 



PCT/GB93/02367 



36 



390 



438 



486 



AGCCTAGCCC CCCGCCACCC GCAGAGCCCC CCCAGAGGGA CC ATC ACC TTC GGC 294 

1 

TCC CCC ACC AAA CCC CTT CTC ATC CTC CTC ATC CCC TTC CTC ACC CAG 342 
ser Pro Arg Lye Cly Leu Leu Met L«u Leu Met Ala Leu Val Thr Gin 
5 . 10 15 20 

OCA CAC CCT CTC AAO CCC TCT CCC CCC CCC CTC CTC ACC TCC ACC TOT 
Cly Asp Pro Val Lye Pro Ser Arg Cly Pro Leu Val Thr Cya Thr Cya 
* 25 30 35 

CAC ACC CCA CAT TCC AAC CCC CCT ACC TCC CCC CCC CCC TCC TCC ACA 
Clu Ser Pro Hla Cya Lye Cly Pro Thr Cya Arg Cly Ala Trp Cya Thr 
40 « 50 

GTA CTC CTC CTC CCC CAC CAC CCC AGC CAC CCC CAG GAA CAT CCC CCC 

val val Leu Val Arg Glu Glu Cly Arg Bia Pro Cln Clu Hia Arg Giy 
55 60 65 

TCC CCC AAC TTC CAC ACC CAG CTC TCC ACC CCC CCC CCC ACC CAC TTC 534 
Cya Cly Aen Leu Hie Arg Clu Leu Cya Arg Gly Arg Pro Thr Glu Phe 
70 75 80 

CTC AAC CAC TAC TCC TCC CAC ACC CAC CTC TCC AAC CAC AAC CTC TCC 
val Aen Hia Tyr Cye Cye Asp Ser Hie Leu Cya Aan Hia Aan Val Ser 
85 90 95 100 

CTC CTC CTC CAG CCC ACC CAA CCT CCT TCC CAG CAC CCC CCA ACA CAT 
Leu Val Leu Clu Ala Thr Cln Pro Pro Ser Clu Gin Pro Cly Thr Aap 
105 110 115 

CCC CAC CTC CCC CTC ATC CTC CCC CCC CTC CTC CCC TTC CTC CCC CTC 678 
Cly Cln Leu Ala Leu lie Leu Cly Pro Val Leu Ala Leu Leu Ala Leu 
120 125 130 

CTC CCC CTC CCT CTC CTC CCC CTC TCC CAT CTC CCA CCC ACC CAC CAG 726 
Val Ala Leu Cly Val Leu Cly Leu Trp Hie Val Arg Arg Arg Cln Glu 
135 1*0 145 

AAC CAC CCT CCC CTC CAC ACC GAG CTC GCA CAG TCC ACT CTC ATC CTC 774 
Lve Gin Aro Cly Leu Hia Ser Glu Leu Gly Clu Ser Ser Leu lie Leu 
150 155 160 

AAA CCA TCT CAC CAG CCC CAC ACC ATC TTC CCC CAC CTC CTC CAC ACT 
Lve Ala Ser Clu Cln Cly Aep Thr Met Leu Cly Asp Leu Leu Aep Ser 
1 170 175 180 



582 



630 



822 



870 



165 

CAC TGC ACC ACA GOG AGT GGC TCA GCG CTC CCC TTC CTC CTG CAG AGG 
Aep Cye Thr Thr Cly Ser Cly Ser Gly Leu Pro Phe Leu Val Gin Arg 
185 190 195 

ACA CTG GCA CCC CAC CTT CCC TTC CTG CAC TCT CTC CCA AAA GGC CCC 918 
Thr Val Ala Arg Gin Val Ala Leu Val Glu Cya Val Gly Lye Cly Arg 
200 205 210 

TAT GGC CAA CTG TGC CGG GGC TTC TGC CAC CGT CAC AGT CTG GCC CTC 966 
Tyr Gly Glu Val Trp Arg Gly Leu Trp Hie Gly Glu Ser Val Ala Val 
* 215 220 225 
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AAG ATC TTC TCC TCG AGO GAT GAA CAG TCC TGO TTC CGG GAG ACT GAO 1014 
Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg Glu Thr Glu 
230 235 240 

ATC TAT AAC ACA GTA^TTG CTC AGA CAC GAC AAC ATC CTA GGC TTC ATC 1062 
lie Tyr Asn Thr Val Leu Leu Arg Hie Asp Asn lie Leu Gly Phe He 
245 250 255 260 

GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG CTG TGO CTC ATC 1110 
Ale Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu Trp Leu He 
265 270 275 

ACG CAC TAC CAC GAG CAC GGC TCC CTC TAC GAC TTT CTG CAG AGA CAG 1158 
Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu Gin Arg Gin 
280 285 290 

ACG CTG GAG CCC CAT CTG OCT CTG AGG CTA GCT CTG TCC GCG GCA TGC 1206 
Thr Leu Glu Pro His Leu Ale Leu Arg Leu Ale Vel Ser Ale Ale Cys 
295 300 305 

GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA CAG GGC AAA CCA 1254 
Gly Leu Ala His Leu Hie Val Glu He Phe Gly Thr Gin Gly Lye Pro 
310 315 320 

GCC ATT GCC CAC CGC GAC TTC AAG AGC CGC AAT GTG CTG GTC AAG AGC 1302 
Ala He Ala His Arg Asp Phe Lye Ser Arg Asn Val Leu Val Lys Ser 
325 330 335 340 

AAC CTG CAG TGT TGC ATC GCC GAC CTG GCC CTG GCT CTG ATG CAC TCA 1350 
Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Ala Val Met His Ser 
345 350 355 

CAG GGC AGC GAT TAC CTG GAC ATC GGC AAC AAC COG AGA CTG GGC ACC 1398 
Gin Gly Ser Asp Tyr Leu Asp He Gly Asn Asn Pro Arg Val Gly Thr 
360 365 370 

AAG CGG TAC ATG CCA CCC GAG GTG CTG GAC GAG CAG ATC CGC ACG CAC 1446 
Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin He Arg Thr Asp 
375 380 385 

TCC TTT GAG TCC TAC AAG TGC ACT CAC ATC TGG GCC TTT GGC CTG GTG 1494. 
Cys Phe Glu Ser Tyr Lys Trp Thr Asp He Trp Ala Phe Gly Leu Val 
390 395 400 

CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC ATC GTG GAG GAC 1542 
Leu Trp Glu He Ala Arg Arg Thr He Val Asn Gly He Val Glu Asp 
405 410 415 420 

TAT AGA CCA CCC TTC TAT GAT CTG CTG CCC AAT CAC CCC AGC TTT CAG 1590 
Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 
425 430 435 

GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC CCC ACC ATC CCT 1638 
Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr He Pro 
440 445 450 

AAC CGG CTG GCT CCA GAC CCG GTC CTC TCA GGC CTA GCT CAG ATG ATG 1686 
Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 
455 460 465 
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COG GAG TCC TGC TAC CCA AAC CCC TCT 6CC C6X CTC ACC CCO CIC 06C 1734 
Arg Clu Cye Trp Tyr Pro Aen Pro Ser Ala Arg Leu Thr Ala Leu Arg 
470 475 480 

ATC AAG AAG ACA CTA CAA AAA ATT AGC AAC ACT CCA, GAG AAG CCT AAA 1782 
lie Lye Lye Thr Leu Gin Lye He Ser Aan Ser Pro Glu Lye Pro Lye 
485 490 495 500 

CTG ATT CAA TAGCCCAGCA GCACCTGATT CCTTTCTGCC TCCAGCGCCC 1831 
Val He Gin 

TGGGGGGGTG GGGGGCAGTG CATGCTGCCC TATCTCGGTA GAGGTAGTGT CAGTGTGGTG 1891 

TGTGCTGGCG ATGGGCAGCT CCGCCTGCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 

ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA I 984 

(2) INFORMATION FOR SEQ ID NO J 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Leu Cly Ser Pro Arg Lye Gly Leu Leu Met Leu Leu Met Ala 
15 10 15 

Leu Val Thr Gin Gly Aep Pro Val Lye Pro Ser Arg Gly Pro Leu Val 
20 25 30 

Thr Cys Thr Cye Glu Ser Pro Hie Cye Lye Gly Pro Thr Cye Arg Gly 
35 40 45 

Ala Trp Cye Thr Val Val Leu Val Arg Glu Glu Gly Arg Hie Pro Gin 
50 55 60 

Glu Hie Arg Gly Cye Gly Aen Leu Hie Arg Glu Leu Cye Arg Gly Arg 
65 70 75 80 

Pro Thr Glu Phe Val Aen Hie Tyr Cye Cys Aep Ser Hie Leu Cye Aen 
85 90 95 

His Asn Val Ser Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin 
100 105 HO 

Pro Gly Thr Aep Gly Gin Leu Ala Leu He Leu Cly Pro Val Leu Ala 
115 120 125 

Leu Leu Ala Leu Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg 
130 135 140 

Arg Arg Gin Glu Lye Gin Arg Gly Leu Hie Ser Glu Leu Gly Glu Ser 
14S 150 155 160 
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Ser Leu He Leu Lys Ala Ser Clu Gin Gly Asp Thr Met Leu Gly Asp 
165 170 175 

Leu Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe 
180 185 190 

Leu Val Gin Arg Thr Val Ale Arg Gin Val Ala Leu Val Glu Cys Val 

* 195 200 ^ 205 

Gly Lya Gly Arg Tyr Gly Clu Val Trp Arg Gly Leu Trp His Gly Glu 
210 215 220 

Ser Val Ala Val Lys He Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe 
225 230 235 240 

Arg Glu Thr Glu He Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie 
245 250 255 

Leu Gly Phe He Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin 
260 265 270 

Leu Trp Leu He Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe 
275 280 285 

Leu Gin Arg Gin Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val 
290 295 300 

Ser Ala Ala Cys Gly Leu Ala His Leu His Val Glu He Phe Gly Thr 
305 310 315 320 

Gin Gly Lys Pro Ala He Ala His Arg Asp Phe Lys Ser Arg Asn Val 
325 330 335 

Leu Val Lys Ser Asn Leu Gin Cys Cys He Ala Asp Leu Gly Leu Ala 
340 345 350 

Val Met His Ser Gin Gly Ser Asp Tyr Leu Asp He Gly Asn Asn Pro 
355 360 365 

Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin 
370 375 380 

He Arg Thr Asp Cys Phe Clu Ser Tyr Lys Trp Thr Asp He Trp Ala 
385 390 395 400 

Phe Gly Leu Val Leu Trp Glu He Ala Arg Arg Thr He Val Asn Gly 
405 410 415 

He Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp 
420 425 430 

Pro Ser Phe Glu Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr 
435 440 445 

Pro Thr He Pro Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu 
450 455 460 

Ala Gin Met Met Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu 
465 470 475 480 
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Thr Ala Leu Arg lie Lye Lye Thr Leu ©In Lye Xl« Ser Aen Ser Pro 
485 490 495 

Clu Lye Pro Lye Val lie Gin 

500 

(2) INFORMATION FOR SSQ ID NO: 3: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2724 baee paire 

(B) TYPE: nucleic acid 

(C) STKANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo eapiene 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104. .1630 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CTCCGAGTAC CCCAGTGACC AGAGTCAGAG AAGCTCTGAA CGAGGGCACG CGGCTTGAAG 60 

GACTGTGGGC AGATGTGACC AAGAGCCTGC ATTAAGTTGT ACA ATG GTA GAT GGA 115 

Met Val Aep Gly 
1 

GTG ATG ATT CTT CCT CTG CTT ATC ATG ATT GCT CTC CCC TCC CCT AGT 163 
Val Met lie Leu Pro Val Leu lie Met He Ala Leu Pro Ser Pro Ser 
5 10 15 20 

ATG GAA GAT GAG AAG CCC AAG GTC AAC CCC AAA CTC TAC ATC TCT GTG 211 
Met Glu Asp Glu Lys Pro Lye Val Aen Pro Lye Leu Tyr Met Cye Val 
25 30 35 

TGT GAA GGT CTC TCC TGC GGT AAT GAG GAC CAC TGT GAA GGC CAG CAG 259 
Cye Glu Gly Leu Ser Cye Gly Aen Glu Aep Hie Cye Clu Gly Gin Gin 
1 40 45 50 

TGC TTT TCC TCA CTG AGC ATC AAC GAT GGC TTC CAC GTC TAC CAG AAA 307 
Cye Phe Ser Ser Leu Ser He Aen Aep Gly Pho Hie Val Tyr Gin Lye 
55 60 65 

GGC TCC TTC CAG GTT TAT GAG CAG GGA AAG ATG ACC TGT AAG ACC CCG 355 
Gly Cye Phe Gin Val Tyr Glu Gin Gly Lye Met Thr Cye Lye Thr Pro 
70 75 80 
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CCC TCC CCT GGC CAA OCT CTC CAG TGC TCC CAA CCC GAC TOO TGT AAC 403 
Pro Ser Pro Gly Gin Ala Val Glu Cya Cys Gin Gly Asp Trp Cys Aan 
85 90 95 100 

AGG AAC ATC ACG GCC CAG CTC CCC ACT AAA GGA AAA TCC TTC CCT GGA 451 
Arg Aan I la Thr Ala Gin Leu Pro Thr Lys Gly Lya Sar Pha Pro Gly 
105 110 115 

ACA CAG AAT TTC CAC TTG GAG CTT CGC CTC ATT ATT CTC TCT CTA CTG 499 
Thr Gin Aan Pha Ilia Leu Glu Val Gly Leu Zla Zla Lau Sar Val Val 
120 125 130 

TTC GCA GTA TGT CTT TTA CCC TCC CTG CTC GGA GTT CCT CTC CGA AAA 547 
Pha Ala Val Cya Leu Leu Ala Cya Leu Leu Gly Val Ala Leu Arg Lya 
135 140 145 

TTT AAA AGG CGC AAC CAA GAA CGC CTC AAT CCC CGA GAC CTC GAG TAT 595 
Phe Lya Arg Arg Aan Gin Glu Arg Leu Aan Pro Arg Asp Val Glu Tyr 
150 155 160 

CCC ACT ATC CAA GGG CTC ATC ACC ACC AAT CTT CGA CAC ACC ACT TTA 643 
Gly Thr lie Glu Gly Leu Zle Thr Thr Aan Val Gly Asp Ser Thr Leu 
165 170 175 180 

CCA GAT TTA TTG CAT CAT TCC TCT ACA TCA CCA ACT CCC TCT CGT CTT 691 
Ala Asp Leu Leu Asp His Ser Cys Thr Ser Gly Ser Gly Ser Gly Leu 
185 190 195 

CCT TTT CTC CTA CAA ACA ACA GTC CCT CCC CAG ATT ACA CTC TTG GAC 739 
Pro Phe Leu Val Gin Arg Thr Val Ala Arg Gin Zle Thr Leu Leu Glu 
200 205 210 

TGT CTC CCC AAA CCC AGG TAT CCT CAG CTC TGC ACG GCC ACC TCG CAA 787 
Cys Val Gly Lys Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Gin 
215 220 225 

GGG GAA AAT CTT CCC CTG AAG ATC TTC TCC TCC CGT GAT GAG AAG TCA 835 
Gly Glu Ann Val Ala Val Lys Zle Phe Ser Ser Arg Asp Glu Lya Ser 
230 235 240 

TGC TTC ACC CAA ACG GAA TTG TAC AAC ACT CTG ATC CTG ACG CAT GAA 883 
Trp Phe Arg Glu Thr Glu Leu Tyr Aan Thr Val Met Leu Arg His Glu 
245 250 255 260 

AAT ATC TTA GCT TTC ATT CCT TCA CAC ATC ACA TCA ACA CAC TCC ACT 931 
Asn lie Leu Gly Phe Zle Ala Ser Asp Met Thr Ser Arg His Ser Ser 
265 270 275 

ACC CAG CTG TCC TTA ATT ACA CAT TAT CAT CAA ATC CGA TCC TTC TAC 979 
Thr Gin Leu Trp Leu Zle Thr Bis Tyr His Glu Mat Gly Sar Leu Tyr 
280 285 290 

GAC TAT CTT CAG CTT ACT ACT CTG CAT ACA CTT ACC TCC CTT CCA ATA 1027 
Asp Tyr Leu Gin Leu Thr Thr Leu Asp Thr Val Ser Cya Leu Arg Zle 
295 300 305 

CTC CTC TCC ATA CCT ACT CGT CTT CCA CAT TTG CAC ATA GAG ATA TTT 1075 
Val Leu Ser Zle Ala Ser Gly Leu Ala Hia Leu Hia Zla Glu Zla Phe 
310 315 320 
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cce xec CAA CCC AAA CCX CCC ATT CCC CAT CCA CAT TTA AAC ACC AAA 1123 
cty i£ o£ SJ ijl fS. Al. U. AU Hi. Ar, A.p L.u Ly. S.r Ly. 
325 330 335 

AAT ATT CTC CTT AAC AAC AAT CCA CAC TCT TCC ATA CCA CAT TTC CCC 1171 
£1 SI £u 5" E. Ly. A.n Cly Cln Cy. Cy. U. Al. A.p Leu Cly 
345 350 

CTC CCA CTC ATC CAT TCC CAC ACC ACC AAT CAC CTT CAT CTG CCC AAC 1219 
SS aS 52 Sit Si ier Cin ser Thr A.n Cln Leu A.p Val Cly A.n 
360 365 3 

AAT CCC CCT CTC CCC ACC AAC CCC TAC ATC CCC CCC CAA CTT CTA CAT 1267 
?rt 5.1 cij Thr Xy. Arg Tyr M.t Ala Pro clu Val L.u A.p 
375 380 385 

cxx ACC ATC CAC CTC CAT TCT TTC CAT TCT TAT AAA ACC CTC CAT ATT 1315 
3S iS 51? 1.1 Cy. Ph. A. P S.r Tyr Ly. Arg Val A.p II. 
390 3 « 5 400 



TCC CCC TTT CCA CTT CTT TTC TCC CAA CTC CCC ACC CCC ATC CTC ACC 1363 
Tr? aS Oly Zeu Val Leu Trp Clu Val Ala Arg Arg Met Val S.r 
405 «10 415 

AAT CCT ATA CTC CAC CAT TAC AAC CCA CCC TTC TAC CAT CTC CTT CCC 1411 
JSS el? II. vil Clu A.p Tyr Ly. Pro Pro Ph. Tyr A.p Val Val Pro 
425 430 *35 

AAT CAC CCA ACT TTT CAA CAT ATC ACC AAC CTA CTC TCT CTC CAT CAA 1459 
lin Asp Pro S.r Ph. Clu A.p M.t Arg Ly. Val Val Cy. Val A.p Cln 
440 445 450 

CAA ACC CCA AAC ATA CCC AAC ACA TCC TTC TCA CAC CCC ACA TTA ACC 1507 
Cln Arg Pro A«n II. Pro A.n Arg Trp Ph. S.r A.p Pro Thr Leu Thr 
455 460 465 

TCT CTC CCC AAC CTA ATC AAA CAA TCC TCC TAT CAA AAT CCA TCC CCA 1555 
Ser Leu Ala Ly. Leu Met Ly. clu Cy. Trp Tyr Cln Asn Pro Ser Ala 
470 475 480 

ACA CTC ACA CCA CTC CCT ATC AAA AAC ACT TTG ACC AAA ATT CAT AAT 1603 
j£g* iS All Arg lie Ly. Ly. Thr Leu Thr Ly. lie A.p A.n 
485 490 495 soo 

TCC CTC CAC AAA TTC AAA ACT CAC TCT TGACATTTTC ATACTCTCAA 1650 
Ser Leu Asp Lyi Leu Ly. Thr A«p Cy. 
505 

CAACCAACAT TTCACCTTCT TCTCATTCTC CAGCTCCCAC CTAATCCTCC CCTCACTCCT 1710 

TGTCACAATC CAATCCATCT CTCTCCCTCC CCAAATCCCT CCTTTCACAA CCCACACCTC 1770 

CTACCCACCC ATCTCTTCCC CACACATCAA AACCACCCTA ACCTCCCTCC ATCACTCTCA 1830 

ACTGCCCATT TCACCAACTC TTCACACTCC AGACACTAAT GTTCCACAGA CACTCTTCCA 1890 

AACCTACCCA CTCCACCAAC ACAGACAAAT CCTAAAAGAC ATCTCCCCAT TAACTCACTC 1950 

CCTTTCCATA CCTTTCACAA CTCTCCTACA CACTCCCCAC CCCAAACTCA AGCACCTCCT 2010 
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GAATTTTTAA TCAGCAATAT TCCCTCTCCT TCICTTCTTT ATTGCACTAC GAATTCTTTC 2070 

CATTCCTTAC TTOCACTCTT ACTCTTAATT 7TAAACACCC AACTTCCCAA AATCTTCCCT 2130 

CCGTACTCCA CTCCTCTCTC TTTCCATAAT ACCAATTCAA TTTGGCAAAA CAAAATCTAA ^ 2190 

TCTCACACTT TGCTCCATTT TACACATCTC CTGATOTTTA CAATCATCCC GAACATTAGG 2250 

AATTGTTTAT ACACAACTTT CCAAATTATT TATTACTTCT CCACTTAGTA CTTTTTACAA 2310 

AACTGCTTTC TGCATATGTT AAACCTTATT TTTATCTCCT CTTATCATTT TATTACAOAA 2370 

ATGTTTTTAA CACTATACTC TAAAATCCAC ATTTTCTTTT ATTATCAGTT AAAATCACAT 2430 

TTTAAGTGCT TCACATTTGT ATGTGTGTAG ACTGTAACTT TTTTTCAOTT CATATGCACA 2490 

ACGTATTTAG CCATTACCCA CGTGACACCA CCGAAXATAI TATCGATTTA GAAGCAAAGA 2S50 

TTTCAGTAGA ATTTTAGTCC TGAACGCTAC GGGGAAAATG CATTTTCTTC AGAATTATCC 2610 

ATTACCTGCA TTTAAACTCT GCCAGAAAAA AATAACTATT TTGTTTTAAT CTACTTTTTG 2670 
TATTTAGTAG TTATTTGTAT AAATTAAATA AACTGTTTTC AAGTCAAAAA AAAA 

(2) INFORMATION FOR SEQ ID NOl 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Val Asp Cly Val Met He Leu Pro Val Leu He Met He Ala Leu 

Pro ser Pro ser Met Glu Aop Glu Ly. Pro Lye Val Asn Pro Lys Leu 

20 25 30 

Tyr Met Cys Val Cye Glu Gly Leu Ser Cy« Gly Aen Glu Asp His Cye 

* 35 40 45 

Glu Gly Gin Gin Cye Phe Ser Ser Leu Ser He Aen Aap Gly Phe Hie 

50 55 60 

val Tyr Gin Lys Gly Cys Phe Gin Val Tyr Glu Gin Gly Lys Met Thr 
65 70 75 

Cys Lys Thr Pro Pro Ser Pro Gly Gin Ala Val Clu Cys Cys Gin Gly 

* * 95 90 9* 

Asp Trp Cys Asn Arg Asn He Thr Ala Gin Leu Pro Thr Lys Gly Lys 

Ser Phe Pro Cly Thr Gin Asn Phe His Leu Glu Val Gly Leu He He 
H5 120 125 
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X..U Ser Val V.l Ph. Ala Val Cy. L.U L.U Al. Cy. L.U L.U Oly VI 

130 135 
Ala Leu Arg Ly. Ph. Ly. Arg Arg A.n Cln Gi« Ar, L.u A.n Pro Arg 

145 150 

A.p VI Cl« Tyr Oly Thr XI. Clu Cly Leu 11. Thr Thr A.n Val Cly 

165 

A.p s.r Thr Leu Ala A-p Leu Leu A-P Hi. S.r Cy. Thr ser Cly S.r 



180 



Cly ser Cly Lau Pro Ph. Leu Val Cln Arg Thr Val Ala Arg Cln XI. 



195 



Thr Leu Leu Clu Cy. Val Cly Ly. Cly Arg Tyr Cly Clu Val Trp Ar, 

210 215 " 

Cly ser Trp Gla Cly Clu A.n Val Ala Val Ly. II. Ph. S.r Ser Arg 
225 230 

A. P Clu Ly. S.r Trp Ph. Arg Clu Thr Clu L.u Tyr A.n Thr Val Met 

Leu Arg Hi. Clu A.n II. L.u Cly Ph. II. Ala S.r A.p M.t Thr S.r 

260 265 
Arg Hi. ser S.r Thr Gin Leu Trp Leu II. Thr Hi. Tyr Hi. Clu M.t 



Oly Ser Leu Tyr A.p Tyr Leu Gin Leu Thr Thr Leu A.p Thr Val S.r 
290 295 

Cy. Leu Arg II. Val Leu Ser II. Ala S.r Gly Leu Ala Hi. Leu Hi. 
305 310 **» 

II. Clu II. Ph. Cly Thr Cln Cly Ly. Pro Ala II. Ala Hi. Arg A.p 

325 330 

Leu Ly. Ser Ly. A.n lie Leu Val Ly. Ly. A.n Cly Gin Cy. Cy. II. 



340 



Ala Asp Leu Cly Leu Ala Val Met Hi. S.r Cln Ser Thr A.n Cln Leu 



355 



370 ' 375 
Clu val Leu A.p Clu Thr lie Cln Val A.p Cy. Phe A.p Ser Tyr Ly. 



A.p Val Gly A.n A.n Pro Arg Val Cly Thr Ly. Arg Tyr Mat Ala Pro 

cy. 

385 390 395 

Arg Val Asp II. Trp Ala Ph. Cly Leu Val Leu Trp Clu Val Ala Arg 
405 4iu 

Arg Met Val Ser A.n Cly lie Val Clu A.p Tyr Ly. Pro Pro Phe Tyr 
420 4" 

A.p Val Val Pro A.n A.p Pro S.r Ph. Clu A.p M.t Arg Ly. Val Val 
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Cvi Val Asp Cln Gin Arg Pro Asn He Pro Aen Arg Trp Ph# Ser Asp 

1 450 455 460 

Pro Thr Leu Thr Ser L*u Ala Lye Leu Met Lys Clu Cys Trp Tyr Gin 

465 470 475 480 

Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg He Lye Lye Thr Leu Thr 

485 490 , 495 

Lye He Asp Asn Ser Leu Aep Lye Leu Lys Thr Asp Cye 

500 505 



(2) INFORMATION FOR SEQ ID NO: 5: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 2932 base pairs 

(B) TYPEt nucleic acid 

(C) STRAND ED NESS s unknown 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL! NO 

(iii) ANTI-SENSE i NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 310. .1905 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCTCCGCCCC GAGGGCTGGA GGATGCGTTC CCTGGGGTCC GGACTTATGA AAATATGCAT 60 

CACTTTAATA CTGTCTTGGA ATTCATGAGA TGCAAGCATA GGTCAAAGCT CTTTGGAGAA 120 

AATCAGAACT ACAGTTTTAT CTAGCCACAT CTTGGAGGAG TCGTAAGAAA GCAGTGGGAG 180 

TTGAAGTCAT TGTCAAGTCC TTCCCATCTT TTACAAGAAA ATCTCACTGA ATGATAGTCA 240 

TTTAAATTGG TGAAGTAGCA AGACCAATTA TTAAACCTCA CACTACACAC GAAACATTAC 300 

AATTCAACA ATG ACT CAG CTA TAC ATT TAC ATC AGA TTA TTC GCA GCC 348 
Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Gly Ala 
15 10 

TAT TTC TTC ATC ATT TCT CCT GTT CAA GCA CAG AAT CTG GAT ACT ATG 396 
Tyr Leu Phe He He Ser Arg Val Cln Gly Cln Asn Leu Asp Ser Met 
15 20 25 
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CTT CAT CCC ACT GGG ATC AAA TCA CAC TCC CAC CAO AAA AAC TCA CAA 
St* cly Thr Cly Met Ly. Ser Asp Ser Asp Gin Lys Lys S.r Clu 
30 35 40 

AAT CCA CTA ACC TTA CCA CCA CAC CAT ACC TTC CCT TTT TTA AAO TCC 
xln Cly val Thr L.u Ala Pro Clu Asp Thr Leu Pro Ph. Leu Lys Cys 
50 55 «0 

TAT TGC TCA CGG CAC TCT CCA CAT GAT OCT ATT AAT AAC ACA TGC ATA 
Tyr Cys ser Gly His Cys Pro Asp Asp Ala II. Asn Asn Thr Cys II. 

ACT AAT CCA CAT TCC TTT CCC ATC ATA CAA CAA CAT CAC CAC CCA CAA 

ThJ JSn Cly His Cys Ph. Ala II. II. Glu Clu Asp Asp Gin Gly Clu 

80 85 90 

ACC ACA TTA GCT TCA CCC TGT ATC AAA TAT CAA GGA TCI CAT TTT CAC 
Thr Thr Leu Ala S.r Cly Cys Mat Lys Tyr Clu Cly Ssr Asp Ph. Gin 
95 100 10S 

TCC AAA CAT TCT CCA AAA CCC CAC CTA CCC CCC ACA ATA CAA TCT TGT 
cys Lys Asp ssr Pro Lys Ala Cln Leu Arg Arg Thr II. Clu Cys Cys 
210 115 120 «5 

CCC ACC AAT TTA TCT AAC CAC TAT TTC CAA CCC ACA CTC CCC CCT GTT 
Arg Thr Asn Leu Cys Asn Cln Tyr Leu Cln Pro Thr Leu Pro Pro Val 
130 135 1*0 

CTC ATA CCT CCC TTT TTT GAT CCC ACC ATT CCA TCC CTC CTT TTC CTC 
Val He Gly Pro Phe Phe Asp Gly Ser He Arg Trp Leu Val Leu Leu 
145 ISO 1S5 

ATT TCT ATC CCT CTC TCC ATA ATT GCT ATC ATC ATC TTC TCC ACC TGC 
11. Ser Met Ala Val Cys He He Ala Met He II. Ph. Ser S.r Cys 
160 165 170 

TTT TCT TAC AAA CAT TAT TGC AAC ACC ATC TCA ACC ACA CCT CCT TAC 
Phe cys Tyr Lys His Tyr Cys Lys Ser He Ser Ser Arg Arg Arg Tyr 
175 180 185 

AAT CCT GAT TTG GAA CAC GAT GAA CCA TTT ATT CCA GTT GGA GAA TCA 
Asn Arg Asp Leu Clu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser 
190 195 200 205 

CTA AAA CAC CTT ATT GAC CAG TCA CAA ACT TCT CCT ACT CCC TCT GGA 
Leu Lvs Asp Leu He Asp Gin Ser Gin Ser Ser Gly Ser Cly Ser Cly 
* 210 215 220 

CTA CCT TTA TTG CTT CAG CCA ACT ATT CCC AAA CAG ATT CAG ATC GTC 
Leu Pro Leu Leu Val Cln Arg Thr He Ala Lys Gin He Gin Met Val 
225 230 235 

CCC CAA CTT CCT AAA CCC CCA TAT CCA CAA CTA TCC ATC CCC AAA TGC 

Arg Gin Val Cly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp 
240 245 250 

CCT CCC CAA AAA CTC GCC GTC AAA CTA TTC TTT ACC ACT CAA CAA GCC 
Arg Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Clu Clu Ala 
255 260 265 



444 



492 



540 



S88 



636 



684 



732 



780 



828 



876 



924 



972 



1020 



1068 
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AGC TGG TTT CGA CAA ACA GAA ATC TAC CAA ACT GTC CTA ATC CGC CAT 1164 
Ser Trp Phc Arg Clu Thr Glu XI* Tyr Gin Thr Val Leu Met Arg Bi« 
270 275 280 285 

CAA AAC ATA CTT GOT TTC ATA GCC CCA GAC ATT AAA OCT ACA OCT TCC 1212 
Clu Aon He Leu Cly Phe He Ala Ala Aep He Lye Cly Thr Gly Ser 
290 295 300 

TCC ACT CAC CTC TAT TTC ATT ACT CAT TAC CAT CAA AAT CCA TCT CTC 1260 
Trp Thr Cln Leu Tyr Leu lie Thr Aep Tyr Hie Clu Aen Cly Ser Leu 
305 310 315 

TAT CAC TTC CTC AAA TCT CCT ACA CTC CAC ACC ACA CCC CTC CTT AAA 1308 
Tyr Aep Phe Leu Lye Cye Ale Thr Leu Aep Thr Arg Ale Leu Leu Lye 
320 325 330 

TTC CCT TAT TCA CCT CCC TGT GCT CTC TCC CAC CTC CAC ACA GAA ATT 1356 
Leu Ala Tyr Ser Ale Ale Cye Cly Leu Cye Hie Leu Hie Thr Clu He 
335 340 345 

TAT CCC ACC CAA CCA AAC CCC CCA ATT CCT CAT CCA CAC CTA AAC ACC 1404 
Tyr Cly Thr Cln Cly Lye Pro Ale He Ale Hie Arg Aep Leu Lye Ser 
3S0 355 360 365 

AAA AAC ATC CTC ATC AAC AAA AAT CCC ACT TCC TCC ATT CCT CAC CTC 1452 
Lye Aen He Leu He Lye Lye Aen Cly Ser Cye Cye He Ala Aep Leu 
370 375 380 

GCC CTT GCT CTT AAA TTC AAC AGT CAC ACA AAT CAA CTT CAT CTC CCC 1500 
Cly Leu Ala Val Lye Phe Aen Ser Aep Thr Aen Clu Val Aep Val Pro 
385 390 395 

TTC AAT ACC AGC GTC GGC ACC AAA CCC TAC ATC CCT CCC GAA CTC CTC 1548 
Leu Aen Thr Arg Val Cly Thr Lye Arg Tyr Met Ala Pro Clu Val Leu 
400 405 410 

GAC CAA AGC CTC AAC AAA AAC CAC TTC CAC CCC TAC ATC ATC CCT CAC 1596 
Aep Glu Ser Leu Aen Lye Aen Hie Phe Cln Pro Tyr He Met Ala Asp 
415 420 425 

ATC TAC AGC TTC CGC CTA ATC ATT TGG GAC ATC CCT CCT CCT TCT ATC 1644 
He Tyr Ser Phe Gly Leu He He Trp Clu Met Ala Arg Arg Cye He 
430 435 440 445 

ACA CGA GGG ATC CTG CAA CAA TAC CAA TTC CCA TAT TAC AAC ATC CTA 1692 
Thr Gly Gly He Val Glu Clu Tyr Cln Leu Pro Tyr Tyr Aen Met Val 
450 455 460 

CCG AGT CAT CCG TCA TAC GAA GAT ATC CCT GAG GTT CTG TGT CTC AAA 1740 
Pro Ser Asp Pro Ser Tyr Clu Aep Met Arg Clu Val Val Cye Val Lye 
465 470 475 

CCT TTG CGC CCA ATT CTG TCT AAT CGC TGG AAC ACT CAT CAA TCT CTA 1788 
Arg Leu Arg Pro He Val Ser Aen Arg Trp Aen Ser Aep Clu Cye Leu 
480 485 490 



CCA CCA GTT TTG AAG CTA ATC TCA CAA TGC TCC CCC CAC AAT CCA CCC 
Arg Ala Val Leu Lye Leu Met Ser Clu Cye Trp Ala Hie Aen Pro Ala 
495 500 505 
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TCC AGA CTC ACA CCA TTO AGA ATT AAO AAO ACO CTT OCC AAO ATO OCT 1884 
sir 5£g Su Thr Ala Leu Arg Ue Ly. ly. Thr lau Ala Ly. Met Val 

510 5.15 520 525 

CAA TCC CAA CAT CTA AAA ATC TCATCCTTAA ACCATCCCAC CACAAACTCT 1935 
Glu Ser Gin Asp Val Lys XI* 
530 

ACACTCCAAG AACTCTTTTT ACCCATCCCA TCCCTCCAAT TAGAGTGCAA TAACCATCTT 1995 

AACTTGGTTC TCACACTCTT TCTTCACTAC CTCTTCACAC CCTCCTAATA TTAAACCTTT 20SS 

CAGTACTCTT ATTAGGATAC AAGCTGGGAA CTTCTAAACA CTTCATTCTT TATATATGGA 2115 

CAGCTTTATT TTAAATCTCC TTTTTGATGC CTTTTTTTAA CTGCGTTTTT ATGAACTGCA 2175 

TCAAGACTTC AATCCTGATT AGTGTCTCCA GTCAAGCTCT CGGTACTGAA TTGCCTGTTC 2235 

ATAAAACGGT GCTTTCTGTC AAAOCCTTAA GAAGATAAAT CAGCCCAGCA GAGATGGAGA 2295 

AATAGACTTT GCCTTTTACC TGAGACATTC AGTTCCTTTC TATTCTACCT TTGTAAAACA 2355 

GCCTATAGAT GATGATGTGT TTCGCATACT GCTTATTTTA TGATACTTTG TCCTGTGTCC 2415 

TTAGTCATGT GTGTGTGTCT CCATGCACAT GCACGCCCGG ATTCCTCTGC TGCCATTTGA 2475 

ATTAGAAGAA AATAATTTAT ATCCATCCAC AGGAAGATAT TGGTGGCCGC TCGTTTTGTG 2535 

CTTTAAAAAT GCAATATCTG ACCAAGATTC GCCAATCTCA TACAAGCCAT TTACTTTGCA 2595 

AGTGACATAG CTTCCCCACC AGCTTTATTT TTTAACATGA AAGCTGATGC CAAGGCCAAA 2655 

AGAAGTTTAA AGCATCTGTA AATTTGGACT GTTTTCCTTC AACCACCATT TTTTTTGTGG 2715 

TTATTATTTT TCTCACGGAA AGCATCCTCT CCAAAGTTGG AGCTTCTATT GCCATGAACC 2775 

ATGCTTACAA AGAAAGCACT TCTTATTGAA GTGAATTCCT GCATTTGATA GCAATCTAAG 2835 

TGCCTATAAC CATGTTCTAT ATTCTTTATT CTCAGTAACT TTTAAAAGGG AAGTTATTTA 2895 

TATTTTCTGT ATAATGTGCT TTATTTGCAA ATCACCC 2932 

(2) INFORMATION TOR SEQ ID NOt 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 532 amino acid* 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protain 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Thr Gin Leu Tyr He Tyr He Arg Leu Leu Cly Ala Tyr Leu Phe 
1 5 10 15 

He He Ser Arg Val Gin Gly Gin Asn Leu Aap Ser Met Leu Bis Cly 

20 25 30 



WO 94/11502 PCT/GB93/02367 

49 

Thr Cly Mtit Lys Ssr Asp Ser Asp Gin Lys Lys Ser Glu Asn Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Asp Thr Lsu Pro Phs Leu Lys Cys Tyr Cys Ser 

50 55 60 

Gly His Cys Pro Asp Asp Ala lis Asn Asn Thr Cys lis Thr Asn Gly 

65 70 75 80 

His Cys Phs Ala lis lis Glu Glu Asp Asp Gin Gly Glu Thr Thr Lou 
85 90 95 

Ala Ser Gly Cys Met Lys Tyr Glu Gly Ser Asp Phs Gin Cys Lys Asp 

100 105 110 

Ser Pro Lys Ala Gin Leu Arg Arg Thr lis Glu Cys Cys Arg Thr Asn 
115 120 125 

Leu Cys Asn Gin Tyr Lsu Gin Pro Thr Lsu Pro Pro Val Val lis Gly 
130 135 140 

Pro Phe Phe Asp Gly Ser He Arg Trp Leu Val Leu Leu He Ser Met 
145 150 155 160 

Ala Val Cys lis He Ala Met He He Phe Ser Ser Cys Phe Cys Tyr 

165 170 175 

Lys His Tyr Cys Lys Ser He Ser Ser Arg Arg Arg Tyr Asn Arg Asp 

180 185 190 * v 

Leu Glu Gin Asp Glu Ala Phe He Pro Val Gly Glu Ser Leu Lys Asp 
195 200 205 

Leu He Asp Gin Ser Gin Ser Ser Gly Ser Gly Ser Cly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr He Ala Lys Gin He Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lys Gly Arg Tyr Gly Glu Val Trp Met Cly Lys Trp Arg Gly Glu 
245 250 255 

Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe 
260 265 270 

Arg Glu Thr Glu He Tyr Gin Thr Val Leu Met Arg His Glu Asn He 
275 280 285 

Leu Gly Phe He Ala Ala Asp He Lys Cly Thr Cly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu He Thr Asp Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lys Cys Ala Thr Leu Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr 
325 330 335 

Ser Ala Ala Cys Gly Leu Cys His Leu His Thr Glu He Tyr Gly Thr 
340 345 350 
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Gin Gly Lys Pro Ala lie Ala Hie Arg Asp Leu Lya Ser Lys Am He 

355 360 365 

Leu lie Lyi Lya Asn Gly Ser Cys Cya lie Ala Asp Leu Gly Leu Ala 

370 375 380 

Val Lye Pha Aan Ser Asp Thr Aan Glu Val Asp Val Pro Lau Asn Thr 
385 390 395 400 

Ara Val Gly Thr Lys Arg Tyr Met Ala Pro Glu Val Lau Asp Glu Sar 
405 410 415 

Lau Asn Lys Aan Hia Pha Gin Pro Tyr Ila Mat Ala Asp Ila Tyr Sar 
420 425 430 

Pha Gly Leu Ila Ila Trp Glu Met Ala Arg Arg fcys Ila Thr Gly Gly 
435 440 445 

Ila Val Glu Glu Tyr Gin Lau Pro Tyr Tyr Asn Mat Val Pro Sar Aap 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cya Val Lys Arg Lau Arg 
465 470 475 480 

Pro He Val Sar Asn Arg Trp Aan Ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lys Leu Mat Sar Glu Cys Trp Ala His Asn Pro Ala Sar Arg Leu 

500 505 510 

Thr Ala Leu Arg Ila Lys Lys Thr Leu Ala Lya Mat Val Glu Ser Gin 
515 520 525 

Aap Val Lys He 

530 

<2) INFORMATION FOR SEQ ID NO J 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1515 
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(xi) SBQOSNCK DESCRIPTION t SBQ ZD HO I 7s 

ATG CCG CAC TCC GCC CCA CCC TCC TCC TTC TTC CCC CTT CTT CTC CTC 48 
Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

CTC CTC CCC GCC AGC CCC CCC TCC CCC CCC CCG GGG GTC CAC GCT CTC 96 
Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
20 25 30 

CTC TGT GCC TCC ACC ACC TCC CTC CAC GCC AAC TAC ACG TGT GAG ACA 144 
Leu Cye Ala Cys Thr Ser Cya Leu Gin Ala Aan Tyr Thr Cya Glu Thr 
35 40 45 

CAT GGG GCC TCC ATC CTT TCC TTT TTC AAT CTC GAT CCG ATC GAG CAC 192 
Asp Gly Ala Cya Met Val Ser Phe Phe Aan Leu Aap Gly Met Glu Hia 
50 55 60 

CAT GTC CGC ACC TCC ATC CCC AAA CTG GAG CTC GTC CCT GCC GGG AAG 240 
Hie Val Arg Thr Cya lie Pro Lye Val Glu Leu Val Pro Ala Gly Lya 
65 70 75 80 

CCC TTC TAC TCC CTC ACC TCG GAG GAC CTG CCC AAC ACC CAC TCC TCC 288 
Pro Phe Tyr Cya Leu Ser Ser Glu Aap Leu Arg Aan Thr Hia Cya cya 
85 90 95 

TAC ACT GAC TAC TCC AAC ACC ATC CAC TTC ACC GTC CCC ACT CCT CAC 336 
Tyr Thr Aap Tyr Cya Aan Arg lie Aap Leu Arg Val Pro Ser Gly Hia 
100 105 110 

CTC AAG GAG CCT GAC CAC CCG TCC ATC TCG GCC CCG GTC GAG CTC GTA 384 
Leu Lys Glu Pro Glu Hia Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

GGC ATC ATC CCC CCC CCC CTC TTC CTC CTC TTC CTC ATC ATC ATC ATT 432 
Gly lie lie Ala Gly Pro Val Phe Leu Leu Phe Leu lie lie He He 
130 135 140 

CTT TTC CTT CTC ATT AAC TAT CAT CAC CCT CTC TAT CAC AAC CGC CAG 480 
Val Phe Leu Val He Aan Tyr Hia Gin Arg Val Tyr Hia Aan Arg Gin 
145 150 155 160 

AG A CTC CAC ATC CAA CAT CCC TCA TGT GAC ATC TGT CTC TCC AAA GAC 528 
Arg Leu Aap Met Glu Aap Pro Ser Cya Glu Met Cya Leu Ser Lya Aap 
165 170 175 

AAC ACC CTC CAC CAT CTT CTC TAC CAT CTC TCC ACC TCA GCC TCT CGC 576 
Lya Thr Leu Gin Aap Leu Val Tyr Aap Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

TCA GCC TTA CCC CTC TTT CTC CAC CCC ACA CTG GCC CCA ACC ATC GTT 624 
Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr He Val 
195 200 205 

TTA CAA GAG ATT ATT GCC AAC GCT CCC TTT CCG GAA GTA TCG CCG CGC 672 
Leu Gin Glu He He Gly Lya Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 
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eee TGG AGO COT COT CAT CTO OCT CTO AAA ATA TTC TCT TCT CCT CAA 720 

5? £S ciy a-p v * 1 Al * val **■ l\l ph- s-r s * r **' fiS 

225 230 235 

CAA CGG TCT TGC TTC AGO GAA CCA GAO ATA TAC CAG ACG GTC ATG CTG 768 
Glil Arg sS Trp Ph« Arg Glu Ala Olu II. Tyr Gin Thr V.l Met Leu 
245 250 * ss 

CGC CAT CAA AAC ATC CTT CGA TTT ATT CCT GCT GAC AAT AAA GAT AAT 816 
Si. Glu A.n XI. Leu Cly Ph. U. Ala Ala A.p A.n Ly. A.p A.n 

' 260 265 «70 

GGC ACC TGG ACA CAG CTG TGG CTT GTT TCT GAC TAT CAT GAG CAC GGG 864 
Cly Thr Trp Thr Gin L.u Trp L.u V4l S.r A.p Tyr Hi. Glu Bis Cly 
275 280 285 

TCC CTG TTT GAT TAT CTG AAC GGG TAC ACA GTC ACA ATT GAG CGC ATC 912 
sS SS SS XiJ ?yr L.u A.n Arg Tyr Thr Val Thr II. Glu Gly Met 

295 300 



290 



960 



1008 



ATT AAG CTG CCC TTC TCT GCT GCT ACT GGG CTO GCA CAC CTG CAC ATO 
II. Ly. Leu All Leu S.r Ala Ala Ser Cly L.u Ala Hit Leu Bli Met 
305 310 315 320 

GAG ATC CTG GGC ACC CAA GGO AAG CCT GCA ATT CCT CAT CGA GAC TTA 
Glu XI. Val Gly Thr Gin Gly Ly. Pro Gly II. Ala Hi. Arg A«p L.u 

325 330 335 

AAG TCA AAG AAC ATT CTG GTG AAG AAA AAT GGC ATG TCT GCC ATA GCA 1056 
Ly« ser Lya A.n II. Leu Val Lya Ly. A.n Gly Met Cy. Ala II. Ala 
7 340 345 350 

GAC CTG GGC CTG CCT CTC CCT CAT CAT CCA GTC ACT CAC ACC ATT CAC 1104 
A.p Leu Gly Leu Ala Val Arg Hi. A.p Ala Val Thr A.p Thr II. A.p 
355 360 365 

ATT GCC CCG AAT CAG AGO GTG GGG ACC AAA CCA TAC ATO GCC CCT GAA 1152 
II. Ala Pro A.n Gin Arg Val Gly Thr Ly. Arg Tyr M.t Ala Pro Glu 
370 375 380 

GTA CTT GAT GAA ACC ATT AAT ATG AAA CAC TTT GAC TCC TTT AAA TCT 1200 
Val Leu A.p Glu Thr lie A.n Met Ly. Hi. Ph. Asp Ser Phe Ly. Cy. 
385 390 395 400 

GCT GAT ATT TAT GCC CTC GGG CTT GTA TAT TGG GAG ATT GCT CGA AGA 1248 
Ala Asp II. Tyr Ala Leu Gly Leu Val Tyr Trp Glu II. Ala Arg Arg 
405 410 415 

TGC AAT TCT GCA GCA CTC CAT CAA GAA TAT CAG CTG CCA TAT TAC CAC 1296 

cv. A.n ser Gly Gly Val Hi. Glu Glu Tyr Gin L*u Pro Tyr Tyr Asp 
' 420 425 430 

TTA GTG CCC TCT CAC CCT TCC ATT GAG GAA ATG CGA AAG GTT GTA TCT 1344 
Leu Val Pro Ser Aap Pro Ser II. Glu Clu Met Arg Ly. Val Val Cy. 
435 440 445 

G>T CAG AAG CTC CCT CCC AAC ATC CCC AAC TGG TGC CAC ACT TAT CAG 1392 
A.p Gin Ly. L.u Arg Pro A.n II. Pro A.n Trp Trp Gin S.r Tyr Glu 
450 455 460 
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CCA CTC CCC CTC ATO CCC AA6 ATC ATG OCA GAC TCT TCG TAT CCC AAC 1440 
Ala Leu Arg V*l Met Cly Lye Met Met Arg Glu Cye Trp Tyr Ala Aan 
465 470 475 480 

GCC CCA CCC CCC CTC ACQ CCC CTC CCC ATC AAC AAC ACC CTC TCC CAC 1488 
Cly Ala Ala Arg Leu Thr Ala Leu Arg lie Lye Lye Thr Leu Ser Gin 
485 490 495 

CTC ACC CTC CAC CAA GAC CTC AAG ATC TAACTGCTCC CTCTCTCCAC 1535 
Leu Ser Val Gin Glu Aep Val Lye Zle 
SOO SOS 



ACGGAGCTCC 


TGGCAGCGAG 


AACTAwGGAw 


& ^* m^w^^ mmm m^ 


T***^^ % MM^M% M 


GATGGAGGCC 


1 coc 


TACCTCTCGT 


pjmumwp^ mmm« f* 

TTCTGCCCAG 


CCCTCTGTWi 




enM_MMMMMM« % 

TvrtrCCCGGAA 


M» MMMm M»M» 

wAviKvACAwA 




GCCCGGGAGA 


GACTCGCTGA 


CTCCCATGTT 


GGGTTTGAGA 


GAGAGACCTT 


TTCTATTTAC 


± / 13 


CTCCTAATGG 


CATGGAGACT 




ATTGTGTGGA 


GAAwTCAGTG 


CCACAwV*TU# 


A / * 3 


AACTGGTTGT 


% Mdv»MMm n mm 
AGTGGGAAGT 


CCCGCGAAAC 


CCGGTGCATC 


IHMMM.% MM ejfcMM 

TGGCACGTGG 


MM fk MM « M_ MM% 

CGAGGAGCGA 


ICS -3d 


TGACAGGGGC 


GCTTGGGAGG 


GGCCGGAGGA 


ACCCAGGTGT 


TGCCAGTGCT 


AAGCTGCCCT 


1895 


GAGGGTTTCC 


TTCGGGGACC 


AGCCCACAGC 


ACACCAAGGT 


GGCCCGGAAG 


AACCAGAAGT 


1955 


CCAGCCCCTC 


TCACAGGCAG 


CTCTGAGCCG 


CG CTTTCCCC 


TCCTCCCTCG 


GATGGACGCT 


2015 


GCCGGGAGAC 


TGCCAGTGGA 


GACGGAATCT 


GCCGCTTTGT 


CTGTCCAGCC 


GTGTGTGCAT 


2075 


GTGCCGAGGT 


GCGTCCCCCC 


TTGTGCCTGG 


TTCGTCCCAT 


CCCCTTACAC 


GTGCGTGTGA 


2135 


GTCTGTGTGT 


GTGTCTGTAG 


CTCCGCACTT 


ACCTGCTTGA 


GCTTTCTGTC 


CATGTGCAGG 


2195 


TCGGGGGTGT 


GGTCGTCATG 


CTGTCCG7GC 


TTCCTGGTGC 


CTCTTT7CAG 


TAGTGAGCAG 


22SS 


CATCTAGTTT 


CCCTGGTGCC 


CTTCCCTGGA 


GGTCTCTCCC 


TCCCCCAGAG 


CCCCTGATGC 


2315 


CACAGTGGTA 


CTCTGTGT 










2333 



(2) INFORMATION FOR SEQ ID NOs B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu Val Val Leu 
15 10 15 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly Val Gin Ala Leu 
20 25 30 
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Leu Cys XI* Cys Thr Ser Cys Leu Oln Ala Asn Tyr Thr Cys Clu Thr 
35 40 45 

Asp Cly Aim Cy« Met Val Ser Phe Phe Asn Leu Asp Cly Met Clu Bis 

50 55 60 

His Val Arg Thr Cys He Pro Lye Val Clu Leu Val Pro Ala Cly Lys 

65 70 75 80 

Pro Phe Tyr Cys Leu Ser Ser Clu Asp Leu Arg Asn Thr His ^Cys Cys 
85 90 95 

Tyr Thr Asp Tyr Cys Asn Arg He Asp Leu Arg Val Pro Ser Cly His 

100 105 110 

Leu Lys Clu Pro Clu His Pro Ser Met Trp Cly Pro Val Clu Leu Val 
115 120 125 

Cly He He Ala Cly Pro Val Phe Leu Leu Phe Leu He He He He 

130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Cln 
145 150 155 160 

Arg Leu Asp Met Clu Asp Pro Ser Cys Clu Met Cys Leu Ser Lys Asp 
165 170 175 

Lys Thr Leu Cln Asp Leu Val Tyr Asp Leu Ser Thr Ser Cly Ser Cly 
180 185 190 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr He Val 
195 200 205 

Leu Cln Clu He He Cly Lys Cly Arg Phe Cly Clu Val Trp Arg Gly 
210 215 220 

Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser Ser Arg Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Clu Ala Clu He Tyr Cln Thr Val Met Leu 
245 250 255 

Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn 
260 265 270 

Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Cly 
275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He Glu Gly Met 
290 295 300 

He Lys Leu Ala Leu Ser Ala Ala Ser Cly Leu Ala His Leu His Mst 

305 310 315 320 

Clu He Val Gly Thr Cln Gly Lys Pro Cly He Ala His Arg Asp Leu 
325 330 335 

Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Met Cys Ala He Ala 
340 345 350 
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Asp Leu Cly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr lis Asp 

355 360 365 

lis Ala Pro Asn Gin Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Glu 

370 375 380 

Val Leu Asp Glu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 

385 390 395 400 

Ala Asp He Tyr Ala Leu Gly Leu Val Tyr Trp Glu He Ala Arg Arg 
405 410 415 

Cys Aan Ser Gly Gly Val His Glu Glu Tyr Gin Leu Pro Tyr Tyr Asp 
420 425 430 

Leu Val Pro Ser Asp Pro Ser He Glu Glu Met Arg Lys Val Val Cys 
435 440 445 

Asp Gin Lys Leu Arg Pro Asn He Pro Asn Trp Trp Gin Ser Tyr Glu 
450 455 460 

Ala Leu Arg Val Met Gly Lys Met Met Arg Glu Cys Trp Tyr Ala Asn 
465 470 475 480 

Gly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Gin 
485 490 495 

Leu Ser Val Gin Glu Asp Val Lys He 
500 505 



(2) INFORMATION FOR SEQ ID NO: 9s 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(lx) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 77 ..1585 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGCGAGGCGA GCTTTCCTGG GGTGAGGCAG CGSCGCGGCC GGGCCGGGCC GGGCCACAGG 60 
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60 



109 



157 



205 



CCGTCCCCCC GGGACC ATC GAG COC CCC CTC OCT CCT CCC OCT CCC COO 

Nat Clu Ala Ala Val Ala Ala Pro Arg Pro Arg 
1 5 10 

CTC CTC CTC CTC CTC CTC CCC CCC CCC CCC CCC CCC CCC CCC CCC CTC 
Leu Leu Leu Leu Val Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu 
15 20 25 

CTC CCC CCC CCC ACC CCC TTA CAC TCT TTC TCC CAC CTC TCT ACA AAA 
Lau Pro Cly Ala Thr Ala Leu Cln Cya Pha Cya Hia Lau Cye Thr Lya 
30 3S 40 

CAC AAT TTT ACT TCT CTC ACA CAT CCC CTC TCC TTT CTC TCT CTC ACA 253 
Aap Aon Pha Thr Cya Val Thr Aep Cly Leu Cya Phe Val Ser Val Thr 
45 50 55 

CAC ACC ACA CAC AAA CTT ATA CAC AAC ACC ATC TCT ATA CCT CAA ATT 301 
Clu Thr Thr Aap Lya Val He Hia Aan Ser Met Cya He Ala Clu lie 

*e 70 75 



CAC TTA ATT CCT CCA CAT ACC CCC TTT CTA TCT CCA CCC TCT TCA AAA 349 
Aap Leu He Pro Arg Aap Arg Pro Phe Val Cya Ala Pro Ser Ser Lye 
80 85 90 

ACT CCC TCT CTC ACT ACA ACA TAT TCC TCC AAT CAC CAC CAT TCC AAT 397 
Thr Cly Ser Val Thr Thr Thr Tyr Cya Cya Aan Cln Aap Hia Cye Aan 
95 100 105 

AAA ATA CAA CTT CCA ACT ACT CTA AAC TCA TCA CCT CCC CTT CCT CCT 445 
Lya He Clu Leu Pro Thr Thr Val Lya Ser Ser Pro Cly Leu Cly Pro 
110 115 120 

CTC CAA CTC CCA CCT CTC ATT CCT CCA CCA CTC TCC TTC CTC TCC ATC 493 
Val Clu Leu Ala Ala Val He Ala Cly Pro Val Cya Phe Val Cya He 
125 130 135 

TCA CTC ATC TTC ATC CTC TAT ATC TCC CAC AAC CCC ACT CTC ATT CAC 541 
Ser Leu net Leu Met Val Tyr He Cya Hia Aan Arg Thr Val He Hia 
140 145 150 155 

CAT CCA CTC CCA AAT CAA CAC CAC CCT TCA TTA CAT CCC CCT TTT ATT 589 
Hia Arg Val Pro Aan Clu Clu Aap Pro Ser Leu Aap Arg Pro Phe He 
160 165 170 

TCA CAC CCT ACT ACC TTC AAA CAC TTA ATT TAT CAT ATC ACA ACC TCA 637 
Ser Clu Cly Thr Thr Leu Lya Aap Leu He Tyr Aap Met Thr Thr Ser 
175 180 185 

CCT TCT CCC TCA CCT TTA CCA TTC CTT CTT CAC ACA ACA ATT CCC ACA 685 
Cly Ser Cly Ser Cly Leu Pro Leu Leu Val Gin Arg Thr He Ala Arg 
190 195 200 

ACT ATT CTC TTA CAA CAA ACC ATT CGC AAA CCT CCA TTT GGA CAA CTT 733 

Thr He val Leu Cln Clu Ser He Cly Lye Cly Arg Phe Cly Clu Val 
205 210 215 

TCC ACA CCA AAC TCC CCC GGA CAA CAA CTT CCT CTT AAC ATA TTC TCC 781 
Trp Arg Cly Lya Trp Arg Cly Clu Clu Val Ala Val Lya Ha Phe Ser 
220 225 230 235 
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TCT AGA CAA GAA CGT TCG TGG TTC CCT GAG CCA GAG ATT TAT CAA ACT 829 
Ser Art; Ciu Glu Arg Ser Trp Phe Arg Glu Ala Glu XI* Tyr Gin Thr 
240 24S 250 

GTA ATG TTA CGT CAT GAA AAC ATC CTG GGA TTT ATA CCA CCA GAC AAT 877 
Val Met Leu Arg Hie Glu Aen lie Leu Gly Phe He Ale Ale Asp Aen 
255 260 265 

AAA GAC AAT CGT ACT TGG ACT CAG CTC TGG TTG GTG TCA GAT TAT CAT 925 
Lye Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Vel Ser Aep Tyr Hie 
270 275 280 

GAG CAT GGA TCC CTT TTT GAT TAC TTA AAC AGA TAC AGA CTT ACT CTG 973 
Glu His Gly Ser Leu Phm Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val 
285 290 295 

GAA GGA ATG ATA AAA CTT GCT CTG TCC ACG GCG AGC GGT CTT GCC CAT 1021 
Glu Gly Met He Lye Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala Hie 
300 305 310 315 

CTT CAC ATG GAG ATT CTT GGT ACC CAA GGA AAG CCA GCC ATT GCT CAT 1069 
Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro Ala He Ala His 
320 325 330 

AGA GAT TTC AAA TCA AAG AAT ATC TTG GTA AAG AAG AAT GGA ACT TCC 1117 
Arg Asp Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Gly Thr Cys 
335 340 345 

TCT ATT CCA CAC TTA GGA CTG CCA GTA AGA CAT GAT TCA GCC ACA GAT 1165 
Cys He Ala Asp Leu Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp 
350 355 360 

ACC ATT CAT ATT CCT CCA AAC CAC ACA GTG GGA ACA AAA AGG TAC ATG 1213 
Thr He Asp He Ala Pro Asn His Arg Val Gly Thr Lys Arg Tyr Met 
365 370 375 

GCC CCT GAA GTT CTC CAT CAT TCC ATA AAT ATC AAA CAT TTT CAA TCC 1261 
Ala Pro Glu Val Leu Asp Asp Ser He Asn Met Lys His Phe Glu Ser 
380 385 390 395 

TTC AAA CCT CCT GAC ATC TAT CCA ATG GCC TTA GTA TTC TCG CAA ATT 1309 
Phe Lys Arg Ala Asp He Tyr Ala Met Gly Leu Val Phe Trp Glu He 
400 405 410 

CCT CGA CCA TCT TCC ATT GCT GGA ATT CAT GAA CAT TAC CAA CTC CCT 1357 
Ala Arg Arg Cys Ser He Gly Gly He His Glu Asp Tyr Gin Leu Pro 
415 420 425 

TAT TAT CAT CTT GTA CCT TCT GAC CCA TCA GTT CAA CAA ATG AGA AAA 1405 
Tyr Tyr Asp Leu Val Pro Ser Asp Pro Ser Val Glu Glu Met Arg Lys 
430 435 440 

GTT CTT TCT CAA CAG AAG TTA AGG CCA AAT ATC CCA AAC AGA TCG CAG 1453 
Val Val Cys Glu Gin Lys Leu Arg Pro Asn He Pro Asn Arg Trp Gin 
445 450 455 

AGC TCT CAA GCC TTC ACA GTA ATG GCT AAA ATT ATC ACA GAA TGT TCG 1501 
Ser Cys Glu Ala Leu Arg Val Met Ala Lys He Met Arg Glu Cys Trp 
460 465 470 475 
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TXT CCC AAT CCA CCA OCT ACC CTT ACA CCA TTO CCC ATT AAO AAA ACA 1549 
t£ *5 En el? Ala Ala Arg Uu Thr Ala Leu Arg II. Ly. Ly. Thr 
* 480 * 85 4SO 

TTA TCC CAA CTC ACT CAA CAC CAA CGC ATC AAA ATC TAATTCTACA 1595 
Leu Ser Gin Leu Ser. Cln Gin Clu Cly lie Lys Mat 
495 500 

CCTTTCCCTG AACTCTCCTT TTTTCTTCAC ATCTCCTCCT CGCTTTTAAT TTCCCACGTC 1655 

AGTTGTTCTA CCTCACTCAC ACCGAACAGA ACCATATTCC TTCCTTTTGC AGCAGTCTAA 1715 

TAAAGTCAAT TAAAAACTTC CCACGATTTC TTTGCACCCA GGAAACACCC ATCTGCCTCC 1775 

TTTCTGTCCA CTATCAACGC TTCTTTCCCA CGACAGAAAA TCTGTAGTCT ACCTTTATTT 183S 

TTTATTAACA AAACTTGTTT TTTAAAAAGA TGATTCCTCC TCTTAACTTT AGCTAACTCT 1895 

GCTGTGCTGG AGATCATCTT TAAGGGCAAA GGAGTTGGAT TGCTGAATTA CAATGAAACA 1955 

TGTCTTATTA CTAAAGAAAG TGATTTACTC CTGGTTAGTA CATTCTCAGA GGATTCTGAA 2015 

CCACTAGAGT TTCCTTGATT CAGACTTTGA ATGTACTGTT CTATAGTTTT TCAGGATCTT 2075 

AAAACTAACA CTTATAAAAC TCTTATCTTG AGTCTAAAAA TCACCTCATA TAGTAGTGAG 2135 

GAACATAATT CATCCAATTG TATTTTGTAT ACTATTATTG TTCTTTCACT TATTCACAAC 219S 

ATTACATGCC TTCAAAATGC CATTGTACTA TACCAGTAAG TCCCACTTCT GTCTCTTTCT 22SS 

AATGGAAATG AGTAGAATTG CTGAAAGTCT CTATGTTAAA ACCTATAGTG TTT 2308 

(2) INFORMATION POR SEQ ID NO: 10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Clu Ala Ala Val Ala Ala Pro Arg Pro Arg Leu Leu Leu Leu Val 

1 5 10 15 

Leu Ala Ala Ala Ala Ala Ala Ala Ala Ala Leu Leu Pro Gly Ala Thr 
20 25 30 

Ala Leu Gin Cya Phe Cyo His Leu Cys Thr Lya Asp Aan Phe Thr Cy« 
35 40 45 

Val Thr Asp Cly Leu Cye Phe Val Ser Val Thr Clu Thr Thr Asp Ly« 
50 55 60 

Val He His Asn Ser Met Cys He Ala Clu He Asp Leu He Pro Arg 
65 70 75 80 
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Asp Arg Pro Phe Val Cys Ala Pro Ser Ser Lys Thr Cly Ser Val Thr 

85 90 95 

Thr Thr Tyr Cys Cys Asn Gin Asp His Cys Asn Lys lis Clu Lau Pro 

100 105 110 

Thr Thr Val Lys Ser Ser Pro Gly Leu Gly Pro Val Clu Leu Ala Ala 
115 120 125 

Val lie Ala Gly Pro Val Cys Phe Val Cys He Ser Leu Met Leu Met 
130 135 140 

Val Tyr He Cys His Asn Arg Thr Val He His His Arg Val Pro Asn 
145 150 155 160 

Clu Glu Asp Pro Ser Leu Asp Arg Pro Phe He Ser Clu Cly Thr Thr 
165 170 175 

Leu Lys Asp Leu He Tyr Asp Met Thr Thr Ser Gly Ser Gly Ser Gly 
180 185 190 

Leu Pro Leu Leu Val Gin Arg Thr He Ala Arg Thr He Val Leu Gin 
195 200 205 

Glu Ser He Gly Lys Cly Arg Phe Gly Glu Val Trp Arg Gly Lys Trp 
210 215 220 

Arg Gly Glu Glu Val Ala Val Lys He Phe Ser Ser Arg Glu Glu Arg 
225 230 235 240 

Ser Trp Phe Arg Glu Ala Clu He Tyr Gin Thr Val Met Leu Arg His 
245 250 255 

Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn Gly Thr 
260 265 270 

Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His Glu His Gly Ser Leu 
275 280 285 

Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr Val Glu Cly Met He Lys 
290 295 300 

Leu Ala Leu Ser Thr Ala Ser Gly Leu Ala His Leu His Met Glu 21s 
305 310 315 320 

Val Gly Thr Gin Gly Lys Pro Ala lis Ala His Arg Asp Leu Lys Ser 
325 330 335 

Lys Asn He Leu Val Lys Lys Asn Gly Thr Cys Cys He Ala Asp Leu 
340 345 350 

Gly Leu Ala Val Arg His Asp Ser Ala Thr Asp Thr He Asp He Ala 
355 360 365 

Pro Aen His Arg Val Gly Thr Lys Arg Tyr Met Ala Pro Clu Val Leu 
370 375 380 

Asp Asp Ser He Asn Met Lys His Phe Glu Ser Phe Lys Arg Ala Asp 
385 390 395 400 
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II. Tyr AU Met Cly Leu Val Phe Trp Clu lie Ala Arg Arg Cya Ser 
405 410 415 

He Civ Cly lie Hit Clu Amp Tyr Cln Leu Pro Tyr Tyr Asp Leu Val 
7 1 420 425 430 

Pro ser Asp Pro Ser Val Clu Clu Mot Arg Lye Val Val Cya Clu Cln 
435 440 445 

Lva Leu Arg Pro Aan Ila Pro Aan Arg Trp Cln Sar Cya Clu Ala Lau 
1 450 455 460 

Xro Val Met Ala Lya He Met Arg Clu Cya Trp Tyr Ala Aan Cly Ala 
465 470 475 480 

Ala Arg Leu Thr Ala Leu Arg He Lya Lya Thr Leu Ser Cln Leu Ser 
485 490 495 

Cln Gin Clu Cly He Lya Met 
500 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1922 baae paira 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS : unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouae 

(ix) FEATURE: 

(A) NAME /KEY: COS 

(B) LOCATION: 241.. 1746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: lis 

CAGACCACAC CCCTTCCCAC TCCCCGCAGC CCCCCCCCCA CCCGCGCATC ATCAAGACCT 60 

TTTCCCCGCC CCCACACGCC CTCTCCACCT GAGACCCCCC CCCCCTCCCC AAGCACACCC 120 

CGGGGTCGAC TCCCCCTCTC CAAAGCCCTC AATCTAAACA ATCTTCATTC CTCTTCCCCG 180 

CTGGCGCCAC CCTCAATCCC AGCAAATCTC ACCACATCTC TTCTCCTATC TCCAAGCACC 240 

ATG ACC TTG CCC ACC TTC ACA AGC GCC CTT TTC ATC CTC TCC CTC CCC 288 
Met Thr Leu Cly Ser Phe Arg Arg Cly Leu Leu Met Leu Ser Val Ala 
1 5 10 15 
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TTC CCC CTA ACC CAG CCC ACA CTT GCG AAO CCT TCC AAC CTC CTG AAC 336 
Leu Cly Leu Thr Cln Cly Arg Leu Ala Lys Pro Ser Lye Leu Val Aen 
20 25 30 

TCC ACT TCT CAC ACC CCA CAC TCC AAC ACA CCA TTC TCC GAG CCC TCA 384 
Cye Thr Cye Clu Ser Pro Hie Cye Lye Arg Pro Phe Cye Cln Cly Ser 
35 40 45 

TCC TCC ACA CTC CTC CTC CTT CCA GAG CAC CCC AGG CAC CCC CAG CTC 432 
Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg Hie Pro Gin Val 
50 55 60 

TAT CGC GCC TGT GCG ACC CTC AAC CAC GAG CTC TCC TTC CCA CCT CCC 480 
Tvr Arg Cly Cye Gly Ser Leu Aen Gin Glu Leu Cye Leu Gly Arg Pro 
65 70 75- 80 

ACC CAG TTT CTG AAC CAT CAC TCC TGC TAT AGA TCC TTC TGC AAC CAC 528 
Thr Clu Phe Leu Aen Hie Hio Cye Cye Tyr Arg Ser Phe Cye Aen Hie 
85 90 95 

AAC CTG TCT CTG ATG CTC CAG CCC ACC CAA ACT CCT TCG GAG GAG CCA 576 
Aen Val Ser Leu Met Leu Clu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

CAA CTT GAT GCC CAT CTG CCT CTC ATC CTC CGT CCT GTC CTC CCC TTC 624 
Glu Val Asp Ala Hie Leu Pro Leu lie Leu Cly Pro Val Leu Ala Leu 
115 120 125 

CCC CTC CTC CTC CCC CTC CCT CCT CTC CGC TTC TCC CCT CTC CGG CGC 672 
Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
130 135 140 

AGG CAC CAG AAG CAC CGG GAT TTC CAC ACT CAC CTC CGC CAG TCC AGT 720 
Arg Gin Glu Lye Gin Arg Asp Leu Hie Ser Aep Leu Gly Clu Ser Ser 
145 150 155 160 

CTC ATC CTG AAG CCA TCT CAA CAG CCA CAC ACC ATC TTC CGC CAC TTC 768 
Leu lie Leu Lye Ala Ser Glu Gin Ala Aep Ser Met Leu Gly Aep Phe 
165 170 175 

CTG GAC AGC CAC TCT ACC ACC GCC ACC GGC TCC GGG CTC CCC TTC TTG 816 
Leu Aap Ser Asp Cye Thr Thr Cly Ser Cly Ser Cly Leu Pro Phe Leu 
180 185 190 

CTG CAC ACC ACC CTA CCT CGG CAC CTT CCC CTC CTA CAC TCT CTG CCA 864 
Val Gin Arg Thr Val Ala Arg Cln Val Ala Leu Val Glu Cye Val Gly 
195 200 205 

AAG GCC CCA TAT GGC CAG GTC TCG CGC CCT TCG TGG CAT GCC CAA AGC 912 
Lvs Gly Arg Tyr Gly Glu Val Trp Arg Gly Ser Trp Hie Cly Glu Ser 
210 215 220 

CTG GCC CTC AAG ATT TTC TCC TCA CCA CAT CAG CAG TCC TCG TTC CGG 960 
Val Ala Val Lye lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg 
225 230 235 240 

CAC ACG CAC ATC TAC AAC ACA CTT CTC CTT ACA CAC CAC AAC ATC CTA 1008 
Clu Thr Clu He Tyr Aen Thr Val Leu Leu Arg His Asp Aen He Leu 
245 250 255 
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OCC TTC ATC CCC TCC CAC ATG ACT TCO COO AAC TCC AGC ACC CAG CTC 10S6 
el? "S m Ala S.r A.p M.t Thr S.r Arg A.n S.r s.r Thr Oln Leu 

TCC CTC ATC ACC CAC TAC CAT CAA CAC CCC TCC CTC TAT CAC TTT CTC 1104 
™ Su ttj i£ K. Tyr Hi. 01. Hi. Cly Ser Leu Tyr A.p Ph. Leu 

CAO AGO CAC ACO CTC GAG CCC CAG TTC CCC CTC AGO CTA OCT CTC TCC 
Arg Gin Thr Leu Glu Pro Gin Leu Ala Leu Arg Leu Ala Val Ser 
290 295 30 

rcc. ccc TGC GGC CTC GCG CAC CTA CAT GTG CAG ATC TTT CCC ACT CAA 
52 Ala ™ g15 Su Al. Hi. Leu Hi. VI Clu XI. Ph. Cly Thr Oln 



30S 



CCC AAA CCA CCC ATT OCC CAT COT OAC CTC AAO AGT CCC AAT CTO CTC 
Gly Ly. Pro Ala iS Al. Hi. Arg A.p Leu Ly. Ser Arg A.n Val Leu 



32S 



CTC AAG ACT AAC TTC CAO TCT TCC ATT CCA CAC CTC GGA CTC CCT GTG 
51? l£ Mr A.S ™ Gin Cy. Cy. XI. Ala A.p Leu Cly Leu Al. Val 

- 345 Jau 



340 



xtc CAC TCA CAA AGC AAC GAG TAC CTC CAT ATC GGC AAC ACA CCC CCA 
Me? S. £5 Ser A.n Glu Tyr Leu Asp He Gly Ksn Thr Pro Arg 

360 365 



355 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



r-rc CCT ACC AAA AGA TAC ATG GCA CCC CAG GTG CTC GAT GAG CAC ATC 
52 i£ Tyr Met Al. Pro Clu Val Leu A.p Glu Hi. He 
370 375 380 

CGC ACA GAC TCC TTT GAG TCO TAC AAG TGC ACA CAC ATC TOO CCC TTT 
Sg Tte Alp Cy. Ph. Glu Ser Tyr Ly. Trp Thr A.p XI. Trp Al. Ph. 
385 390 395 

GGC CTA CTC CTA TGC GAG ATC CCC CCC CCC ACC ATC ATC AAT CCC ATT 1488 
CW Leu Sal Leu Trp Glu II. Al. Arg Arg Thr Ilo lie A.n Gly II. 
J 405 410 415 



GTG GAG CAT TAC AGO CCA CCT TTC TAT GAC ATG GTA CCC AAT GAC CCC 1536 
Val Glu Asp Tyr Arg Pro Pro Phe Tyr Asp Met Val Pro A.n A.p Pro 
420 *25 

ACT TTT CAG GAC ATG AAA AAC CTC CTC TGC GTT CAC CAG CAC ACA CCC 1584 
Ser Phe Glu A.p Met Ly. Ly. Val Val Cy. Val A.p Gin Gin Thr Pro 
435 440 445 

ACC ATC CCT AAC CGC CTC OCT GCA CAT CCC GTC CTC TCC GGC CTO CCC 1632 
iSr ill ?Vo JUS Lg Leu Al. Ala A.p Pro Val Leu Ser Gly Leu Ala 
450 455 460 

CAO ATC ATC ACA CAO TCC TCC TAC CCC AAC CCC TCT OCT CCC CTC ACC 1680 
£n Met Met Arg Glu Cy. Trp Tyr Pro A.n Pro Ser Ala Arg Leu Thr 
465 «70 475 

CCA CTC CGC ATA AAG AAG ACA TTC CAG AAC CTC ACT CAC AAT CCA CAO 1728 
Ala Leu Arg lie Ly. Lv. Thr Leu Cln Ly. Leu Ser Hi. A.n Pro Clu 
485 490 * ss 



WO 94/11502 . PCT/GB93/02367 

63 

AAC CCC AAA CTG ATT CAC TACCCCACCC CCACCACCCT TCCTCTCCCT 1776 
Lys Pro Lys Val lie His 
500 

AAAGTCTGTC CTGGGGAAGA ACACATAGCC TCTCTGCGTA GAGGGACTCA AGAGAGTGTG 1836 

CACGCTGCCC TCTCTGTGCC TGCTCAGCTT GCTCCCAGCC CATCGAGCCA AAAATACAGC 1896 

TGAGCTGAAA TTCAAAAAAA AAAAAA 1922 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: anino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Thr Leu Gly Ser Phe Arg Arg Gly Leu* Leu Met Leu Ser Val Ala 
1 5 10 15 

Leu Gly Leu Thr Gin Gly Arg Leu Ala Lys Pro Ser Lys Leu Val Aon 
20 25 30 

Cys Thr Cys Glu Ser Pro His Cys Lys Arg Pro Phe Cys Gin Gly Ser 
35 40 45 

Trp Cys Thr Val Val Leu Val Arg Glu Gin Gly Arg His Pro Gin Val 
50 55 60 

Tyr Arg Gly Cys Gly Ser Leu Asn Gin Glu Leu Cys Leu Gly Arg Pro 
65 70 75 80 

Thr Glu Phe Leu Asn His His Cys Cys Tyr Arg Ser Phe Cys Asn His 
85 90 95 

Asn Val Ser Leu Met Leu Glu Ala Thr Gin Thr Pro Ser Glu Glu Pro 
100 105 110 

Glu Val Asp Ala His Leu Pro Leu lie Leu Gly Pro Val Leu Ala Leu 
115 120 125 

Pro Val Leu Val Ala Leu Gly Ala Leu Gly Leu Trp Arg Val Arg Arg 
13C 135 140 

Arg Gin Glu Lys Gin Arg Asp Leu His Ser Asp Leu Gly Glu Ser Ser 
145 150 155 160 

Leu lie Leu Lys Ala Ser Glu Gin Ala Asp Ser Met Leu Gly Asp Phe 
165 170 175 

Leu Asp Ser Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu 
180 185 190 
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Val Cln Arg Thr Val Ala Arg Ola V4l Ala Leu Val Clu Cy. Val Cly 

J95 200 *os 

Ly. Cly Arg Tyr Cly Clu val Trp Arg Cly Ser Trp Hi* Cly Clu Ser 

val Ala Val Ly. Ila Phe Ser Ser Arg A.p Clu Cln Ser Trp Phe Arg 
225 230 2 

Clu Thr Clu Ila Tyr Acn Thr Val Lou Leu Arg Hia Aap Aan II. Lau 
245 250 * 3 » 

Gly Pha Ila Ala Ser Aap Met Thr Ser Arg Asn Ser Ser Thr Cln Leu 

* 260 265 2 

Trp Leu lie Thr Hit Tyr Hia Clu Hia Cly Sar Leu Tyr Aap Pha Leu 

275 280 
Cln Arg Cln Thr Leu Clu Pro Cln Leu Ala Lau Arg Leu Ala Val Ser 
290 2 *5 300 

Pro Ala cy. Cly Leu Ala Hia Leu Hia Val Clu Ila Phe Cly Thr Cln 

305 310 315 320 

Cly Lya Pro Ala Ila Ala Hia Arg Asp Leu Ly. Ser Arg Aan Val Leu 

325 330 JJ3 

Val Ly. Ser A.n Lau Cln Cya Cy. lie Ala A.p Leu Cly Leu Ala Val 

Met Hia Ser Gin ser Asn Clu Tyr Leu A.p He Gly hen Thr Pro Arg 
355 360 365 

Val Cly Thr Lya Arg Tyr Met Ala Pro Clu Val Leu A.p Clu Hia He 
370 375 380 

Arg Thr A.p Cya Pha Clu Ser Tyr Lya Trp Thr Aap 21a Trp Ala Phe 

385 390 395 

Gly Leu Val Leu Trp Clu He Ala Arg Arg Thr He He Aan Cly He 

* 405 410 415 

Val Clu Asp Tyr Arg Pro Pro Ph. Tyr Asp Met Val Pro Aan A.p Pro 
420 *25 «30 

Ser Phe Clu Aap Met Ly. Lya Val Val Cy. Val A.p Gin Gin Thr Pro 
435 440 445 

Thr He Pro A.n Arg Leu Ala Ala A.p Pro Val Leu Ser Gly Leu Ala 
450 4S5 460 

Cln Met Met Arg Clu Cya Trp Tyr Pro Aan Pro Ser Ala Arg Leu Thr 
465 470 475 480 

Ala Leu Arg He Lya Lya Thr Leu Cln Ly. Leu Ser Hi. Aan Pro Clu 
48S 490 495 

Lya Pro Lya Val He Hia 

500 
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(2) INFORMATION FOR SEQ ID HO: 13 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2070 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 217.. 1812 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATTCATGAGA TGGAAGCATA GGTCAAAGCT GTTCGGAGAA ATTCGAACTA CAGTTTTATC 60 

TAGCCACATC TCTCAGAATT CTGAAGAAAG CAGCAGGTCA AACTCATTGC CAACTGATTT 120 

TGTTCTCTAA GGAAGCCTCC CTCATTCACT TACACCAGTG AGACACCAGG ACCAGTCATT 180 

CAAAGGGCCG TCTACAGCAC GCCTGGCAAT CAGACA ATG ACT CAG CTA TAC ACT 234 

Met Thr Gin Leu Tyr Thr 
1 5 

TAC ATC AC A TTA CTC GGA CCC TGT CTC TTC ATC ATT TCT CAT CTT CAA 282 
Tyr He Arg Leu Leu Gly Ala Cys Leu Phe He He Ser His Val Gin 
10 15 20 

GGG CAG AAT CTA GAT AGT ATC CTC CAT GGC ACT GGT ATG AAA TCA GAC 330 
Gly Gin Asn Leu Asp Ser Met Leu His Gly Thr Gly Met Lys Ser Asp 
25 30 35 

TTC GAC CAG AAG AAG CCA GAA AAT GGA GTG ACT TTA GCA CCA CAG GAT 378 
Leu Asp Gin Lys Lys Pro Glu Asn Gly Val Thr Leu Ala Pro Clu Asp 
40 45 50 

ACC TTG CCT TTC TTA AAG TCC TAT TGC TCA GGA CAC TGC CCA GAT GAT 426 
Thr Leu Pro Phe Leu Lys Cys Tyr Cys Ser Gly His Cys Pro Asp Asp 
55 60 65 70 

CCT ATT AAT AAC ACA TGC ATA ACT AAT GGC CAT TCC TTT CCC ATT ATA 474 
Ala He Asn Asn Thr Cys He Thr Asn Gly His Cys Phe Ala He He 
75 80 85 

GAA GAA GAT GAT CAG GGA CAA ACC ACA TTA ACT TCT GGG TGT ATG AAG 522 
Glu Clu Asp Asp Gin Gly Clu Thr Thr Leu Thr Ser Gly Cys Met Lys 
90 95 100 
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TAT GAA CCC TCT CAT TTT CAA TCC AAO CAT TCA CCC AAA CCC CAO CTA 570 
Tyr Glu Cly Ser Asp Ph. Gin Cy. Ly. A.p S.r Pro Ly. Ala Gin L«u 

105 HO * 

CCC AGG ACA ATA GAA TCT TCT CGG ACC AAT TTC TGC AAC CAG TAT TTC 618 
Sg irt Thr 111 Glu Cy. Cy. Arg Thr Asn Leu Cy. A.r. Gin Tyr L.u 
9 120 "5 130 

CAG CCT ACA CTC CCC CCT CTT CTT ATA GOT CCG TTC TTT CAT CCC AGC 666 
Gin Pro Thr Leu Pro Pro Val Val 11. Gly Pro Ph. Ph. Asp Cly Ser 
13S 140' 1*5 150 



ATC CCA TGG CTC GTT CTC CTC ATT TCC ATG OCT CTC TGT ATA GTT CCT 714 
J5 £g Trp Leu Val Val Leu lie Ser Met Ala Val Cy. lie Val Ala 
155 160 165 

ATG ATC ATC TTC TCC AGC TGC TTT TCC TAT AAG CAT TAT TGT AAG AGT 762 
Met He He Ph. S.r Ser Cys Phe Cy. Tyr Ly. Hi. Tyr Cys Ly. Ser 
170 175 180 

ATC TCA AGC AGG GGT CGT TAC AAC CCT CAT TTC CAA CAG CAT GAA GCA 810 
He Ser Ser Arg Cly Arg Tyr Asn Arg Asp Leu Glu Gin Asp Glu Ala 
185 190 195 

TTT ATT CCA GTA GCA GAA TCA TTC AAA CAC CTC ATT GAC CAG TCC CAA 858 
Phe He Pro Val Cly Glu Ser Leu Ly. A.p L.u He Asp Gin ser Gin 
200 205 210 

AGC TCT GGC AGT GCA TCT GCA TTG CCT TTA TTC GTT CAG CGA ACT ATT 906 
Ser ser Gly Ser Gly Ser Gly Leu Pro Leu Leu Val Gin Arg Thr lie 
215 220 225 230 

CCC AAA CAG ATT CAG ATG GTT CCC CAC CTT CGT AAA CCC CCC TAT GCA 954 
Ala Ly. Gin II. Gin Met Val Arg Gin Val Gly Ly. Gly Arg Tyr Gly 
235 240 245 

GAA GTA TGC ATG GGT AAA TGG CGT CCT CAA AAA GTG OCT GTC AAA GTG 1002 
Glu Val Trp Met Cly Lye Trp Arg Gly Glu Lys Val Ala Val Lys Val 
250 255 260 

TTT TTT ACC ACT CAA GAA OCT AGC TCC TTT ACA GAA ACA GAA ATC TAC 
Phe Phe Thr Thr Glu Glu Ala Ser Trp Phe Arg Glu Thr Glu He Tyr 
265 270 275 

CAG ACG GTG TTA ATC CGT CAT GAA AAT ATA CTT CGT TTT ATA OCT GCA 
Gin Thr Val Leu Met Arg Hi. Glu Asn II. Leu Gly Phe He Ala Ala 
280 285 290 

GAC ATT AAA GGC ACT GGT TCC TGC ACT CAG CTG TAT TTG ATT ACT GAT 1146 
Asp He Lys Gly Thr Gly Ser Trp Thr Gin L.u Tyr Leu He Thr Asp 
295 300 305 310 

TAC CAT CAA AAT GCA TCT CTC TAT GAC TTC CTG AAA TGT CCC ACA CTA 1194 
Tyr His Glu Asn Gly Ser Leu Tyr Asp Phe Leu Lys Cys Ala Thr Leu 
315 320 325 

GAC ACC AGA GCC CTA CTC AAG TTA CCT TAT TCT GCT OCT TCT CGT CTG 1242 
Asp Thr Arg Ala Leu Leu Lys Leu Ala Tyr ser Ala Ala Cy. Gly L.u 
330 335 340 



1050 



1098 
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TCC CAC CTC CAC ACA CAA ATT TAT OCT ACC CAA CCC AAG CCT CCA ATT 1290 
Cys His Leu His Thr Glu lis Tyr Gly Thr Gin Gly Lys Pro Ala lis 
34S 350 355 

CCT CAT CGA GAC CTG AAG AGC AAA AAC ATC CTT ATT AAG AAA AAT GGA * 1338 
Ala His Arg Asp Leu Lys Ser Lys Asn He Leu lie Lys Lys Asn Gly 
360 365 370 

AGT TCC TGT ATT CCT CAC CTG GGC CTA CCT GTT AAA TTC AAC ACT GAT 1386 
Ser Cys Cys lie Ala Asp Leu Gly Leu Ala Val Lys Phe Asn Ser Asp 
375 380 385 390 

ACA AAT CAA GTT CAC ATA CCC TTG AAT ACC AGG GTG GGC ACC AAG CGC 1434 
Thr Asn Glu Val Asp lie Pro Leu Asn Thr Arg Val Gly Thr Lys Arg 
395 400 405 

TAG ATG CCT CCA CAA GTG CTG GAT GAA AGC CTG AAT AAA AAC CAT TTC 1482 
Tyr Met Ala Pro Glu Val Leu Asp Glu Ser Leu Asn Lys Asn His Phe 
410 415 420 

CAG CCC TAC ATC ATG CCT CAC ATC TAT AGC TTT GGT TTG ATC ATT TCC 1530 
Gin Pro Tyr lie Met Ala Asp lie Tyr Ser Phe Gly Leu He He Trp 
425 430 435 

CAA ATC CCT CCT CCT TGT ATT ACA CCA CCA ATC CTC CAC CAA TAT CAA 1578 
Clu Met Ala Arg Arg Cys He Thr Gly Gly He Val Clu Glu Tyr Gin 
440 445 450 

TTA CCA TAT TAC AAC ATC CTG CCC ACT CAC CCA TCC TAT CAG CAC ATC 1626 
Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp Pro Ser Tyr Clu Asp Met 
455 460 465 470 

CCT CAC CTT GTG TGT CTC AAA CCC TTC CGC CCA ATC CTC TCT AAC CCC 1674 
Arg Glu Val Val Cys Val Lys Arg Leu Arg Pro He Val Ser Asn Arg 
475 480 485 

TCC AAC AGC GAT GAA TCT CTT CGA CCA CTT TTC AAG CTA ATG TCA CAA 1722 
Trp Asn Ser Asp Glu Cys Leu Arg Ala Val Leu Lys Leu Met Ser Glu 
490 495 500 

TCT TCC CCC CAT AAT CCA GCC TCC AGA CTC ACA CCT TTG ACA ATC AAC 1770 
Cys Trp Ala His Asn Pro Ala Ser Arg Leu Thr Ala Leu Arg He Lys 
505 510 515 

AAC ACA CTT GCA AAA ATG GTT GAA TCC CAG GAT GTA AAG ATT 1812 
Lys Thr Leu Ala Lys Met Val Glu Ser Gin Asp Val Lys He 
520 525 530 



TGACAATTAA 


ACAATTTTCA 


CCGAGAATTT 


AGACTCCAAG 


AACTTCTTCA 


CCCAAGCAAT 


1872 


GCGTCCCATT 


AGCATGCAAT 


AGGATGTTGA 


CTTCGTTTCC 


AGACTCCTTC 


CTCTACATCT 


1932 


TCACACCCTG 


CTAACACTAA 


ACCTTACCGT 


ACTCTACAGA 


ATACAAGATT 


GGAACTTGGA 


1992 


ACTTCAAACA 


TGTCATTCTT 


TATATATCAC 


ACCTTTCTTT 


TAATGTGGGG 


TTTTTTTGTT 


2052 



TCCTTTTTTT CX'TTTGTT 



2070 
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(2) INFORMATION FOR SEQ ID HO: 14 1 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTHt 532 amino acids 

(B) TYPEt amino acid 
<D) TOPOLOGY » linear 

(it). MOLECULE TYPE: protatn 

(xl) SEQUENCE DESCRIPTION t SEQ ID NO» 14* 

Met Thr Gin Leu Tyr Thr Tyr XI* Arg Leu Lau Oly Ala Cya Leu Pha 

1 5 10 15 

He He Ser Hia Val Cln Cly Gin Aan Leu Asp Ser Met Leu Bia Gly 
20 25 30 

Thr Cly Met Lya Ser Aap Leu Aap Cln Lya Lya Pro Clu Aan Gly Val 
35 40 45 

Thr Leu Ala Pro Glu Aap Thr Leu Pro Phe Leu Lya Cye Tyr Cya Ser 

50 55 60 

Cly Hia Cya Pro Aap Aap Ala He Aan Aan Thr Cys He Thr Aan Gly 
65 70 75 80 

Hia Cya Phe Ala He He Glu Glu Aap Aap Gin Gly Glu Thr Thr Leu 
85 90 95 

Thr Ser Gly Cya Met Lya Tyr Glu Gly Ser Aap Phe Gin Cya Lya Aap 
100 105 HO 

Ser Pro Lya Ala Gin Leu Arg Arg Thr He Glu Cya Cya Arg Thr Aan 
115 120 125 

Leu Cya Aan Gin Tyr Leu Gin Pro Thr Leu Pro Pro Val Val He Gly 
130 135 1*0 

Pro Phe Phe Aep Gly Ser He Arg Trp Leu Val Val Leu He Ser Met 
145 150 155 160 

Ala Val Cys He Val Ala Met He He Phe Ser Ser Cya Phe Cya Tyr 
165 170 175 

Lya Hia Tyr Cya Lya Ser He Ser Ser Arg Gly Arg Tyr Aan Arg Aap 

1 180 185 190 

Leu Clu Gin Aap Glu Ala Phe He Pro Val Gly Glu Ser Leu Lya Aep 
195 200 205 

Leu He Aap Gin Ser Gin Ser Ser Gly Ser Gly Ser Gly Leu Pro Leu 
210 215 220 

Leu Val Gin Arg Thr He Ala Lya Gin lie Gin Met Val Arg Gin Val 
225 230 235 240 

Gly Lya Gly Arg Tyr Gly Glu Val Trp Met Cly Lya Trp Arg Gly Glu 
245 250 255 



. i-rr- CUCPT 
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Lys Val Ala Val Lys Val Phe Phe Thr Thr Clu Clu Ala Ser Trp Phe 

260 265 270 

Aro Clu Thr Glu lie Tyr Gin Thr Val Leu Met Arg Hit Clu Asn lie 
275 280 285 

Leu Cly Phe lie Ala Ala Asp He Lye Cly Thr Cly Ser Trp Thr Gin 
290 295 300 

Leu Tyr Leu He Thr Asp Tyr Hie Clu Aen Cly Ser Leu Tyr Asp Phe 
305 310 315 320 

Leu Lye Cye Ala Thr Leu Asp Thr Arg Ala Leu Leu Lye Leu Ala Tyr 

325 330 335 

Ser Ala Ala Cye Cly Leu Cys His Leu His Thr Glu Zle Tyr Gly Thr 
340 345 350 

Gin Gly Lys Pro Ala He Ala His Arg Asp Leu Lys Ser Lys Asn He 
355 360 365 

Leu He Lys Lys Asn Gly Ser Cys Cys He Ala Asp Leu Cly Leu Ala 

370 375 380 

Val Lys Phe Asn Ser Asp Thr Asn Glu Val Asp He Pro Leu Asn Thr 
385 390 395 400 

Arg Val Cly Thr Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Ser 
405 410 415 

Leu Asn Lys Asn His Phe Gin Pro Tyr He Met Ala Asp He Tyr Ser 
420 425 430 

Phe Cly Leu He He Trp Glu Met Ala Arg Arg Cys He Thr Gly Cly 
435 440 445 

He Val Glu Glu Tyr Gin Leu Pro Tyr Tyr Asn Met Val Pro Ser Asp 
450 455 460 

Pro Ser Tyr Glu Asp Met Arg Glu Val Val Cys Val Lys Arg Leu Arg 
465 470 475 480 

Pro He Val Ser Asn Arg Trp Asn Ser Asp Glu Cys Leu Arg Ala Val 
485 490 495 

Leu Lys Leu Met Ser Glu Cys Trp Ala His Asn Pro Ala Ser Arg Leu 
500 505 510 

Thr Ala Leu Arg He Lys Lys Thr Leu Ala Lys Met Val Glu Ser Gin 
515 520 525 

Asp Val Lys Zle 

530 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 
(0) TOPOLOGY s linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE I NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouee 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10. .1524 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCCGGTTAC ATG GCG GAG TCG GCC CGA GCC TCC TCC TTC TTC CCC CTT 48 
Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Leu 
15 10 

GTT GTC CTC CTG CTC GCC GGC AGO GCC GGG TCC GGG CCC CGG GCG ATC 
Val Val Leu Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly He 
15 20 25 



96 



CAC GCT CTC CTC TOT GCG TGC ACC AGC TGC CTA CAG ACC AAC TAC ACC 144 
Gin Ala Leu Leu Cys Ala Cye Thr Ser Cys Leu Gin Thr Asn Tyr Thr 
30 35 40 45 

TCT GAG ACA CAT CGG GCT TGC ATG CTC TCC ATC TTT AAC CTG GAT GGC 192 
Cys Glu Thr Asp Cly Ala Cye Met Val Ser He Phe Aan Leu Aep Gly 
50 5S 60 

CTG CAG CAC CAT GTA COT ACC TCC ATC CCC AAC CTC CAC CTG CTT CCT 240 
Val Glu Hie His Val Arg Thr Cye He Pro Lye Val Glu Leu Val Pro 
65 70 75 

GCT GGA AAG CCC TTC TAC TGC CTG ACT TCA GAG GAT CTG CGC AAC ACA 288 
Ala Gly Lye Pro Phe Tyr Cye Leu Ser Ser Glu Asp Leu Arg Asn Thr 
80 85 90 

CAC TCC TCC TAT ATT CAC TTC TCC AAC AAG ATT CAC CTC AGG GTC CCC 336 
His Cye Cys Tyr He Asp Phe Cys Asn Lye He Asp Leu Arg Val Pro 
95 100 105 

AGC CGA CAC CTC AAG GAG CCT GCG CAC CCC TCC ATG TCG GGC CCT CTG 384 
Ser Gly His Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val 
HO 115 120 125 

CAG CTG GTC GGC ATC ATC GCC GGC CCC GTC TTC CTC CTC TTC CTT ATC 432 
Glu Leu Val Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He 
130 135 140 
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ATT ATC ATC CTC TTC CTC CTC ATC AAC TAT CAC CAC CCT CTC TAC CAT 480 

lift lie He Val Phe Leu Val lie Asn Tyr His Gin Arg Val Tyr His 

145 150 155 

AAC CCC CAC AGO TTC GAC ATG GAG GAC CCC TCT TCC GAG ATC TGT CTC 528 

Ann Arg Gin Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu 
160 165 170 

TCC AAA GAC AAS ACG CTC CAG GAT CTC CTC TAC GAC CTC TCC ACG TCA 576 

Ser Lys Asp Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser 
175 180 185 

GGC TCT GGC TCA CGG TTA CCC CTT TTT GTC CAG CGC ACA GTG GCC CGA 624 

Gly Ser Gly Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg 
190 195 200 205 

ACC ATT GTT TTA CAA CAG ATT ATC GGC AAC GGC CGG TTC GCG GAA GTA 672 

Thr lie Val Leu Gin Glu He He Gly Lys Gly Arg Phe Gly Glu Val 
210 215 220 

TCC CGT GGT CCC TCG AGG GGT GGT GAC GTC CCT GTC AAA ATC TTC TCT 720 

Trp Arg Gly Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser 

225 230 235 

TCT CCT CAA CAA CGC TCT TCC TTC CGT GAA CCA GAC ATC TAC CAC ACC 768 

Ser Arg Glu Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr 
240 245 250 

CTC ATG CTC CCC CAT CAA AAC ATC CTT GGC TTT ATT CCT CCT GAC AAT 816 

Val Met Leu Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn 
255 260 265 

AAA CAT AAT GCC ACC TCC ACC CAC CTC TCG CTT GTC TCT GAC TAT CAC 864 

Lys Asp Asn Gly Thr Trp Thr Gin Leu Trp Leu Val Ser Asp Tyr His 
270 275 280 285 

GAG CAT CCC TCA CTG TTT GAT TAT CTC AAC CGC TAC ACA CTC ACC ATT 912 

Glu His Gly Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr He 
290 295 300 

GAG GGA ATC ATT AAG CTA GCC TTC TCT CCA CCC ACT CCT TTC CCA CAC 960 

Glu Gly Met He Lys Leu Ala Leu Ser Ala Ala Ser Cly Leu Ala His 

305 310 315 

CTC CAT ATC GAG ATT GTC CCC ACT CAA GGG AAG CCC GGA ATT CCT CAT 1008 

Leu His Met Glu He Val Gly Thr Gin Gly Lys Pro Gly He Ala His 
320 325 330 

CGA GAC TTC AAC TCA AAC AAC ATC CTC GTG AAA AAA AAT GGC ATG TGT 1056 

Arg Asp Leu Lys Ser Lys Asn He Leu Val Lys Lys Asn Cly Met Cys 
335 340 345 

GCC ATT CCA GAC CTG GGC CTG CCT CTC CGT CAT GAT GCG GTC ACT GAC 1104 

Ala He Ala Asp Leu Gly Leu Ala Val Arg His Asp Ala Val Thr Asp 
350 355 360 365 

ACC ATA GAC ATT CCT CCA AAT CAC AGG CTG GGG ACC AAA CGA TAC ATG 1152 

Thr He Asp He Ala Pro Asn Gin Arg Val Cly Thr Lys Arg Tyr Met 
370 375 380 
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CCT CCT GAA GTC CTT CAC CAG ACA ATC AAC ATG AAG CAC TTT CAC TCC 1200 

All Pro Glu Val Lau Asp Clu Thr lie Asn Net Lys His Phe Asp Ser 
Aia no w* ' 39Q 

TTC AAA TCT GCC GAC ATC TAT GCC CTC GGG CTT GTC TAC TOO GAG ATT 1248 
Ph. Lys Cys Ala Asp 11. Tyr Ala Lau Gly Leu Val Tyr Trp Glu Ut 
400 * os 410 

CCA CGA AGA TGC AAT TCT GCA GGA GTC CAT GAA GAC TAT CAA CTG CCG 1296 
AU SS Arg Cys Asn Sar Gly Gly V.l His Glu Asp Tyr Gin Leu Pro 
415 420 *25 

TAT TAC GAC TTA GTC CCC TCC GAC CCT TCC ATT CAG GAG ATO CGA AAG 1344 
Tyr Tyr Asp Leu Val Pro S.r Asp Pro S«r lis Glu Glu M.t Arg Lys 
430 435 4*0 445 

CTT GTA TGT GAC CAG AAG CTA CGC CCC AAT GTC CCC AAC TGG TOO CAG 1392 
Val Val Cys Asp Gin Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Gin 
450 4S5 46G 

ACT TAT GAG GCC TTG CGA CTG ATC GGA AAG ATG ATC COG CAO TGC TGG 1440 
Ser Tyr Clu Ala Leu Arg Val Mot Gly Lys Met Mat Arg Glu Cys Trp 
1 465 470 475 

TAC GCC AAT GGT OCT GCC CCT CTC ACA OCT CTG CGC ATC AAG AAG ACT 1488 
Tyr Ala Asn Gly Ala Ala Arg Leu Thr Ala Leu Arg lie Lys Lys Thr 
' 480 4 85 490 

CTC TCC CAG CTA AGC CTG CAG CAA CAT GTC AAG ATT TAAGCTGTTC 1S34 
Leu Ser Gin Leu Ser Val Gin Clu Asp Val Lys lie 
495 500 SOS 



CTCTCCCTAC ACAAAGAACC 


TGGGCAGTGA 


GGATGACTGC AGCCACCGTC 


CAAGCGTCGT 


1594 


GGACCCCTAT CCTCTTCTTT 


CTGCCCGGCC 


CTCTGGCAGA CCCCTGGCCT 


GCAAGAGGGA 


1654 


CAGACCCTGG CACACCOCCO 


CACTCCCGTT 


GGGTTTGACA CAGACACTTT 


TTATATTTAC 


1714 


CTCCTCATGC CATCGACACC 


TGAGCAAATC 


ATGTAGTCAC TCAATCCCAC 


AACTCAAACT 


1774 


G CTT CAG TGG GAAGTACACA 


GACCCAGTGC 


ATTGCGTGTG CAGGAGCGTG 


AGGTGCTGGG 


1834 


CTCGCCAGGA GCGGCCCCCA 


TACCTTGTGG 


TCCACTGGGC TGCAGCTTTT 


CCTCCAGGGA 


1894 


CCAGTCAACT GGCATCAAGA 


TATTGAGAGG 


AACCGGAAGT TTCTCCCTCC 


TTCCCGTAGC 


1954 


AGTCCTGAGC CACACCATCC 


TTCTCATGGA 


CATCCGGAGG ACTGCCCCTA 


GAGACACAAC 


2014 


CTGCTGCCTG TCTGTCCAGC 


CAAGTGCGCA 


TCTGCCCAGG TGTGTCCCAC 


ATTGTGCCTC 


2074 


GTCTGTCCCA CGCCCGTGTC 


TGTGTGTGTG 


TCTCTCAGTG AGTGTGTCTG 


TGTACACTTA 


2134 


ACCTGCTTGA CCTTCTGTGC 


ATGTGT 






2160 



(2) INFORMATION FOR SEQ ID NO* 16 1 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 505 amino acids 
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(B) TTPIi Amino acid 
(0) TOPOLOGY s linear 

(ii) KOLXCULE TYPE: protein 

(xi) sequzncb Description t seq id nos 16« 

Met Ala Glu Ser Ala Gly Ala Ser Ser Phe Phe Pro Lou Val Val Lou 

1 5 10 ^ IS 

Leu Leu Ala Gly Ser Gly Gly Ser Gly Pro Arg Gly lie Gin Ala Leu 
20 25 30 

Lou Cya Ala Cys Thr Ser Cys Leu Gin Thr Asn Tyr Thr Cya Glu Thr 
35 40 45 

Asp Gly Ala Cys Met Val Ser Zle Phe Asn Leu Asp Gly Val Glu His 

50 55 60 

His Val Arg Thr Cys He Pro Lys Val Glu Leu Val Pro Ala Gly Lys 
65 70 75 80 

Pro Phe Tyr Cys Leu Ser Ser Glu Asp Leu Arg Asn Thr His Cys Cys 
85 90 95 

Tyr He Asp Phe Cys Asn Lys He Asp Leu Arg Val Pro Ser Gly His 
100 105 110 

Leu Lys Glu Pro Ala His Pro Ser Met Trp Gly Pro Val Glu Leu Val 
115 120 125 

Gly He He Ala Gly Pro Val Phe Leu Leu Phe Leu He Zle He He 
130 135 140 

Val Phe Leu Val He Asn Tyr His Gin Arg Val Tyr His Asn Arg Gin 
14S 150 155 160 

Arg Leu Asp Met Glu Asp Pro Ser Cys Glu Met Cys Leu Ser Lys Asp 
165 170 175 

Lys Thr Leu Gin Asp Leu Val Tyr Asp Leu Ser Thr Ser Gly Ser Gly 
180 185 190 

Ser Gly Leu Pro Leu Phe Val Gin Arg Thr Val Ala Arg Thr He Val 
195 200 205 

Leu Gin Glu He He Gly Lys Gly Arg Phe Gly Glu Val Trp Arg Gly 
210 215 220 

Arg Trp Arg Gly Gly Asp Val Ala Val Lys He Phe Ser Ser Arg Glu 
225 230 235 240 

Glu Arg Ser Trp Phe Arg Glu Ala Glu He Tyr Gin Thr Val Met Leu 
245 250 255 

Arg His Glu Asn He Leu Gly Phe He Ala Ala Asp Asn Lys Asp Asn 
260 265 270 
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Cly Thr Trp Thr Cln Leu Trp Leu Val Ser Asp Tyr Hit Clu His Cly 
275 280 285 

Ser Leu Phe Asp Tyr Leu Asn Arg Tyr Thr Val Thr lis Clu Cly Met 

290 295 300 

He Lvs Lou Ala Leu Sar Ala Ala Sar Cly Leu Ala His Leu His Met 

3QS 310 ,315 320 

Clu lie Val Cly Thr Cln Cly Lys Pro Cly He Ala His Arg Asp Leu 

325 330 335 

Lvs Ser Lys Asn He Leu Val Lys Lys Asn Cly Met Cys Ala He Ala 

J 340 - 345 350 

Asp Leu Cly Leu Ala Val Arg His Asp Ala Val Thr Asp Thr He Asp 

355 360 365 

He Ala Pro Asn Cln Arg Val Cly Thr Lys Arg Tyr Met Ala Pro Clu 
370 375 380 

Val Leu Asp Clu Thr He Asn Met Lys His Phe Asp Ser Phe Lys Cys 
385 390 395 400 

Ala Asp He Tyr Ala Leu Cly Leu Val Tyr Trp Clu He Ala Arg Arg 
r 405 410 415 

Cys Asn Ser Cly Cly Val His Clu Asp Tyr Cln Leu Pro Tyr Tyr Asp 
1 420 425 430 

Leu Val Pro Ser Asp Pro Ser He Clu Clu Met Arg Lys Val Val Cys 
435 440 445 

Asp Cln Lys Leu Arg Pro Asn Val Pro Asn Trp Trp Cln Ser Tyr Clu 
450 455 460 

Ala Leu Arg Val Met Cly Lys Met Het Arg Clu Cys Trp Tyr Ala Asn 
465 470 475 480 

Cly Ala Ala Arg Leu Thr Ala Leu Arg He Lys Lys Thr Leu Ser Cln 
485 490 495 

Leu Ser Val Gin Glu Asp Val Lys He 
500 505 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1952 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY t unknown 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISMS Mouse 

(ix) FEATURE I 

(A) NAME /KEY: CDS 

(B) LOCATION: 187. .1692 



(xi) SEQUENCE DESCRIPTIONS SEQ ID NOs 17: 

AAGCGGCGCC AGAAGTTGCC GGCGTGGTGC TCGTAGTGAG GGCGCGGAGG ACCCGGGACC 60 

TGGGAAGCGG CGGCGGGTTA ACTTCGGCTG AATCACAACC ATTTGGCGCT CAGCTATGAC 120 

AAGAGAGCAA ACAAAAAGTT AAAGGAGCAA CCCGGCCATA AGTGAAGAGA GAAGTTTATT 180 

GATAAC ATC CTC TTA CGA AGC TCT GGA AAA TTA AAT GTG GGC ACC AAG 228 
Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys 
15 10 

AAG GAG CAT GGA GAG AGT ACA GCC CCC ACC CCT CGG CCC AAG ATC CTA 276 
Lys Glu Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys He Leu 
15 20 25 30 

CCT TGT AAA TGC CAC CAC CAC TCT CCC GAA CAC TCA GTC AAC AAT ATC 324 
Arg Cys Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn He 
35 40 45 

TGC AGC ACA CAT CGG TAC TGC TTC ACG ATC ATA GAA GAA GAT GAC TCT 372 
Cys Ser Thr Asp Gly Tyr Cys Phe Thr Met He Glu Glu Asp Asp Ser 
50 55 60 

GGA ATC CCT GTT CTC ACC TCT GGA TGT CTA GGA CTA GAA GOG TCA GAT 420 
Gly Met Pro Val Val Thr Ser Gly Cys Leu Gly Leu Glu Gly Ser Asp 
65 70 75 

TTT CAA TGT CCT GAC ACT CCC ATT CCT CAT CAA AGA AGA TCA ATT GAA 468 
Phe Gin Cys Arg Asp Thr Pro He Pro His Gin Arg Arg Ser He Glu 
80 85 90 

TGC TGC ACA GAA AGG AAT GAG TGT AAT AAA GAC CTC CAC CCC ACT CTG 516 
Cys Cys Thr Glu Arg Asn Glu Cys Asn Lys Asp Leu His Pro Thr Leu 
95 100 105 110 

CCT CCT CTC AAG CAC ACA CAT TTT GTT CAT GGG CCC ATA CAC CAC AAG 564 
Pro Pro Leu Lys Asp Arg Asp Phe Val Asp Gly Pro He His His Lys 
115 120 125 

GCC TTC CTT ATC TCT GTG ACT GTC TGT AGT TTA CTC TTG GTC CTC ATT 612 
Ala Leu Leu He Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu He 
130 135 140 

ATT TTA TTC TGT TAC TTC AGC TAT AAA AGA CAA CAA GCC CGA CCT CGG 660 
He Leu Phe Cys Tyr Phe Arg Tyr Lys Arg Gin Glu Ala Arg Pro Arg 
145 150 155 
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TAC ACC ATT GCC CTG CAG CAG GAC GAG ACA TAC ATT CCT CCT GGA GAG 

Tyr Ser II* Gly L*u Clu Gin A»p Glu Thr Tyr XI* Pro Pro Cly Clu 
160 165 170 

TCC CTG AGA GAC TTG ATC CAG CAG TCT CAG AGC TCG GGA AGT GGA TCA 
Set Leu Arg Asp L*u XI* Glu Gin S»r Gin S«r Sor Gly S*r Gly S*r 
175 180.. 185 190 

GGC CTC CCT CTG CTG CTC CAA AGO ACA ATA CCT AAO CAA ATT- CAG ATC 
Gly Leu Pro Leu L*u Val Gin Arg Thr II* Ala Ly* Cln II* Gin Mat 
195 200 205 

GTG AAG CAG ATT GGA AAA GGC CGC TAT CGC GAG CTC TCC ATC CCA AAG 
V*l Ly* Cln II* Cly Ly* Oly Arg Tyr Gly Glu V*l Trp M*t Cly Ly* 
210 215 220 

TCC CCT CCA CAA AAG CTC CCT CTG AAA CTC TTC TTC ACC ACC CAG CAA 
Trp Arg Cly Clu Ly* Val Ala Val Ly* Val Ph* Ph* Thr Thr Clu Clu 
225 230 235 

GCC AGC TCC TTC CCA CAG ACT CAG ATA TAT CAG ACC CTC CTC ATC CGC 
Ala Ser Trp Ph* Arg Glu Thr Glu II* Tyr Gin Thr Val Leu Met Arg 
240 245 250 

CAT GAG AAT ATT CTC GGC TTC ATT CCT CCA CAT ATC AAA CGC ACT GGC 
Hi* Clu Asn II* Leu Cly Ph* He Ala Ala A«p II* Ly* Cly Thr Gly 
255 260 265 270 

TCC TGG ACT CAG TTG TAC CTC ATC ACA GAC TAT CAT CAA AAC GCC TCC 
Ser Trp Thr Cln Leu Tyr Leu He Thr Aap Tyr Hi* Clu A*n Gly S*r 
275 280 285 

CTT TAT GAC TAT CTG AAA TCC ACC ACC TTA GAC CCA AAG TCC ATC CTC 
Leu Tyr Asp Tyr Leu Ly* Ser Thr Thr Leu Asp Ala Ly* S*r M*t Leu 
290 295 300 

AAC CTA GCC TAC TCC TCT CTC AGC CGC CTA TCC CAT TTA CAC ACC GAA 
Lv* Leu Ala Tyr Ser Ser Val Ser Gly Leu Cy* Hi* Leu Hi* Thr Glu 
' 305 310 315 

ATC TTT AGC ACT CAA CGC AAG CCA CCA ATC GCC CAT CCA GAC TTG AAA 

lie Phe Ser Thr Gin Gly Ly* Pro Ala He Ala Hi* Arg Asp Leu Ly* 

320 325 330 

AGT AAA AAC ATC CTG GTG AAC AAA AAT GGA ACT TCC TCC ATA CCA GAC 
Ser Lys Asn XI* Leu Val Lys Lys Asn Gly Thr Cy* Cys II* Ala Asp 
335 340 345 350 

CTC CGC TTC CCT CTC AAC TTC ATT AGT GAC ACA AAT GAG CTT CAC ATC 
Leu Gly Leu Ala Val Ly* Ph* XI* Ser Asp Thr Asn Glu Val Asp XI* 
355 360 365 

CCA CCC AAC ACC CGC CTT GGC ACC AAG CGC TAT ATC CCT CCA GAA GTG 
Pro Pro Asn Thr Arg Val Gly Thr Lys Arg Tyr Met Pro Pro Clu Val 
370 375 380 

CTC GAC GAG AGC TTG AAT AGA AAC CAT TTC CAC TCC TAC ATT ATC CCT 
Leu Asp Glu Ser Leu Asn Arg Asn Hi* Ph* Cln Ser Tyr He Met Ala 
385 390 395 



708 



756 



804 



852 



900 



948 



996 



1044 



1092 



1140 



1188 



1236 



1284 



1332 



1380 
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CAC ATC TAC AGC TTT CCA CTC ATC CTC TOO CAO ATT CCA AGO AGA TGT 1428 
Asp Mot Tyr Ser Phe Gly Leu He Leu Trp Glu He Ala Arg Arg Cys 
400 405 410 

CTT TCT GCA GGT ATA CTC CAA CAA TAC CAC CTT CCC TAT CAC CAC CTQ 1476 
Val Ser Gly Gly I la Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu 
415 420 425 430 

GTG CCC ACT CAC CCT TCT TAT GAG CAC ATG AGA GAA ATT CTC TCC ATC 1524 
Val Pro Ser Asp Pro Ser Tyr Glu Asp Met Arg Glu lie Val Cys Met 
435 440 445 

AAG AAG TTA CCG CCT TCA TTC CCC AAT CCA TGG AGC AGT GAT GAG TGT 1572 
Lys Lye Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys 
450 455 460 

CTC AGG CAC ATG CCC AAG CTT ATG ACA GAG TCC TGG CCC CAC AAT CCT 1620 
Leu Arg Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro 
465 470 475 

CCC TCC AGG CTC ACC CCC CTC ACA CTT AAG AAA ACC CTT GCC AAA ATG 1668 
Ala Ser Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met 
480 485 490 

TCA GAG TCC CAC GAC ATT AAA CTC TGACCTCACA TACTTCTCCA CAGACCAAGA 1722 
Ser Glu Ser Gin Asp lie Lys Leu 
495 500 

ATTTCACAGA AGCATCCTTA CCCCAACCCT TGAACGTTAC CCTACTGCCC ACTCACTTCA 1782 

GACTTTCCTC GAAGAGACCA CCGTCCCCAG ACACAGAGCA ACCCACAAAC ACCCATTCAT 1842 

CATCCCTTTC TCACCACCAC AAACT G TTT G GGTAACTTCT TCAACATATC ATCCATGTTC 1902 

CTTTCTAAGA AAGCCCTCTA TTTTCAATTA CCATTTTTTT ATAAAAAAAA 1952 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 502 amino acids 
(8) TYPE: amino acid 
(D) TOPOLOCr: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Leu Leu Arg Ser Ser Gly Lys Leu Asn Val Gly Thr Lys Lys Glu 
15 10 15 

Asp Gly Glu Ser Thr Ala Pro Thr Pro Arg Pro Lys lie Leu Arg Cys 
20 25 30 

Lys Cys His His His Cys Pro Glu Asp Ser Val Asn Asn He Cys Ser 

35 40 45 

Thr Aep Gly Tyr Cys Phe Thr Met He Glu Glu Asp Asp Ser Gly Met 
50 55 60 
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Pro val Val Thr Ser Cly Cys Leu Gly Leu Clu Cly Ser Asp Phe Cln 

65 70 75 80 

Cys Arg Asp Thr Pro Zle Pro His Cln Arg Arg Ser Zle Clu Cys Cys 
85 90 95 

Thr Clu Arg Asn Clu Cy» Asn Lys Aap Leu His Pro Thr Leu Pro Pro 
100 105 110 

Leu Lys Asp Arg Asp Phe Val Asp Cly Pro Zle His His Lys Ala Leu 

115 120 12S 

Leu Zle Ser Val Thr Val Cys Ser Leu Leu Leu Val Leu Zle Zle Leu 
130 135 140 

Phe Cys Tyr Phe Arg Tyr Lys Arg Cln Clu Ala Arg Pro Arg Tyr Ser 
145 150 155 160 

Zle Cly Leu Clu Cln Asp Clu Thr Tyr Zle Pro Pro Cly Clu Ser Leu 
165 170 175 

Ara Asp Leu Zle Glu Gin Ser Cln Ser Ser Gly Ser Cly Ser Gly Leu 
180 185 190 

Pro Leu Leu Val Cln Arg Thr Zle Ala Lys Cln Zle Cln Met Val Lys 
195 200 205 

Gin lie Cly Lys Gly Arg Tyr Gly Glu Val Trp Met Gly Lys Trp Arg 
210 215 220 

Gly Glu Lys Val Ala Val Lys Val Phe Phe Thr Thr Glu Glu Ala Ser 
225 230 235 240 

Trp Phe Arg Clu Thr Clu Zle Tyr Cln Thr Val Leu Met Arg His Clu 
245 250 255 

Asn Zle Leu Cly Phe Zle Ala Ala Asp Zle Lys Cly Thr Cly Ser Trp 
260 265 270 

Thr Cln Leu Tyr Leu lis Thr Asp Tyr His Clu Asn Cly Ser Leu Tyr 
275 280 285 

Asp Tyr Leu Lys Ser Thr Thr Leu Asp Ala Lys Ser Met Leu Lys Leu 

290 295 300 

Ala Tyr Ser Ser Val Ser Gly Leu Cys His Leu His Thr Glu Zle Phe 
305 310 315 320 

Ser Thr Cln Gly Lys Pro Ala Zle Ala His Arg Asp Leu Lys Ser Lys 
325 330 335 

Asn Zle Leu Val Lys Lys Asn Cly Thr Cys Cys Zle Ala Asp Leu Gly 
340 345 350 

Leu Ala Val Lys Phe Zle Ser Asp Thr Asn Glu Val Asp Zle Pro Pro 
355 360 365 

Asn Thr Arg Val Cly Thr Lys Arg Tyr Met Pro Pro Clu Val Leu Asp 
370 375 380 
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Glu Ser Leu Asn Arg Asn His Phe Gin Ser Tyr He Met Ala Asp Met 

385 350 395 400 

Tyr Ser Phe Gly Leu He Leu Trp Glu He Ale Arg hrg Cys Val Ser 
405 410 415 

Gly Gly He Val Glu Glu Tyr Gin Leu Pro Tyr His Asp Leu Val Pro 
420 425 430 

Ser Asp Pro Ser Tyr Glu Asp Met hrg Glu He Val Cys Met Lys Lys 
435 440 445 

Leu Arg Pro Ser Phe Pro Asn Arg Trp Ser Ser Asp Glu Cys Leu Arg 
450 455 460 

Gin Met Gly Lys Leu Met Thr Glu Cys Trp Ala Gin Asn Pro Ala Ser 
465 470 475 480 

Arg Leu Thr Ala Leu Arg Val Lys Lys Thr Leu Ala Lys Met Ser Glu 
485 490 495 

Ser Gin Asp He Lye Leu 
500 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE0NESS: single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGGATCCTG TTGTGAAGCN AATATGTG 28 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCGATCCGTC GCAGTCAAAA TTTT 24 

(2) INFORMATION FOR SEQ ID NOi 21 1 

(i) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCGGATCCGC CATATATTAA AAGCAA 26 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGGAATTCTG CTCCCATATA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 basa pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NOt 23i 
ATTCAAGGCC ACATCAACTT CATTTGTCTC ACTGTTG 

(2) INFORMATION FOR SEQ ID NOt 24 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS s single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE 1 NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCGCATCCAC CATGGCGGAG TCGGCC 26 



37 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACACCGGGC CGGCGATGAT 20 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION* SEQ ID NOt 26t 

Cly Xaa Cly Xaa Xaa Cly 

1 5 

(2) INFORMATION FOR SEQ ID NO: 27 1 

(1) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 S 

Aep Phe Lya Ser Arg Asn 

1 5 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: paptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asp Leu Lya Ser Lya Aan 

1 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29s 

Gly Thr Lya Arg Tyr Met 
1 5 
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CLAIMS 

1. An isolated protein having a serine/threonine kinase 
domain, a DFKSRK or DLKSKN sequence in subdomain VIB and/or 
a GTKRYM sequence in subdomain VIII. 
5 2. A protein according to claim 1, which additionally 
comprises an ATP-binding -sequence that is Gly-Xaa-Gly-Xaa- 
Xaa-Gly in subdomain I, and a Lys residue in subdomain II. 

3. An isolated protein having a serine/threonine kinase 
domain which has more than 50% identity to the kinase 

10 domain of any of the amino-acid sequences identified herein 
as SEQ ID. Nos. 2, 4, 6, 8, 10, 12, 14, 16 and 18. 

4. A protein according to claim 3, wherein the identity 
is more than 60%. 

5. A protein according to any preceding claim, having 
15 serine/ threonine kinase activity. 

6. An isolated protein having all or part of any of the 
amino-acid sequences identified herein as SEQ. ID Nos. 2, 
4, 6, 8, 10, 12, 14, 16 and 18, and activin receptor type 
I functionality. 

20 7. An isolated protein having an amino-acid sequence 
corresponding to part or all of the amino-acid sequence of 
an activin type I receptor, and wherein the protein has at 
least one of the following characteristics : - 

(i) serine/threonine kinase activity; 
25 (ii) activin-binding activity; and 

(iii) activin type II receptor interaction. 

8. An isolated protein having all or part of any of the 
amino-acid sequences identified herein as SEQ. ID Nos. 2, 
4, 6, 8, 10, 12, 14, 16 and 18, and TGF-B-type I receptor 

30 functionality. 

9. An isolated protein having an amino-acid sequence 
corresponding to part or all of the amino-acid sequence of 
a TGF-B-type I receptor, and wherein the protein has at 
least one of the following characteristics: 

35 (i) serine/threonine kinase activity; 

(ii) TGF-B-binding activity; and 

(iii) TGF-B-type II receptor interaction. 
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10. A protein according to any of claims 1 to 5, having 
all or part of the amino-acid sequence identified herein as 
SEQ 10 No. 2. 

11. A protein according to any of claims 1 to 7, having 
5 all or part of the amino-acid sequence identified herein as 

SEQ ID No. 4. 

12. A protein according to any of claims 1 to 5, having 
serine/ threonine kinase activity and all or part of the 
amino-acid sequence identified herein as SEQ ID No. 6. 

10 13. A protein according to any of claims 1 to 7, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 8. 

14. A protein according to any of claims 1 to 5, 8 and 9, 
having all or part of the amino-acid sequence identified 

15 herein as SEQ ID No. 10. 

15. A protein according to any of claims 1 to 5, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 12. 

16. A protein according to any of claims 1 to 5, having 
20 all or part of the amino-acid sequence identified herein as 

SEQ ID No. 14. 

17. A protein according to any of claims 1 to 7, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 16. 

25 18. A protein according to any of claims 1 to 5, having 
all or part of the amino-acid sequence identified herein as 
SEQ ID No. 18. 

19. A protein acording to any preceding claim, that is a 
soluble receptor. 
30 20. An antibody which binds specifically to a protein as 
defined in any of claims 1 to 19 and not to at least one 
other such protein. 

21. An isolated nucleic acid molecule which codes for, or 
is complementary to a nucleic acid molecule which codes 

35 for, a protein as defined in any of claims 1 to 19. 

22. A recombinant nucleic acid molecule comprising at 
least two heterologous sequences, one of which codes for, 
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or is complementary to a nucleic acid molecule which codes 
for, a protein as defined in any of claims 1 to 19 • 
23. A molecule according to claim 21 or claim 22, wherein 
the protein is a TGF-fl-type I receptor • 
5 24 • A molecule according to claim 21 or claim 22, wherein 
the protein is an activin receptor. 

25. A DNA or RNA/mRNA molecule according to any of claims 
21 to 24. 

26. A molecule according to any of claims 20 to 24, which 
10 additionally comprises, operably associated with the coding 

sequence, a sequence adapted to allow expression of the 
protein. 

27. A host comprising a molecule according to claim 26, 
which is capable of expressing the protein. 

15 28. A host according to claim 27, which comprises PAE 
cells. 

29. A host according to claim 27 or claim 28, transfected 
with the Chim A receptor plasmid. 

30. A product according to any preceding claim, for 
20 therapeutic or diagnostic use. 

31. Use of a product according to any of claims 1 to 29, 
for the manufacture of a medicament for use in treating a 
condition associated with TGF activity. 
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